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METHODS AND COMPOUNDS FOR MODULATING NUCLEAR RECEPTOR ACTIVITY 
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this invention. 

INTRODUCTION 
Technical Field 

10 ■ The present invention relates to improved methods and compounds for modulating 

nuclear receptor activity. In particular, the present invention relates to improved methods and 
compounds for modulating estrogen receptor activity. 

Background 

Cells contain receptors that can elicit a biological response by binding various molecules 
15 including proteins, hormones and/or drugs. Nuclear receptors represent a super family of 
proteins that are hormone/ligand-activated transcription factors that enhance or repress 
transcription in a cell type-, ligand- and promoter-dependent manner. A classic nuclear receptor, 
the estrogen receptor a (ERa) is a key factor in regulating the differentiation and maintenance of 
neural, skeletal, cardiovascular and reproductive tissues (Korach, Science 266:1524-1527 
20 (1994);Smith,etal.,lto£llgl.I-M£d.331:1056.l061 (1994)). The nuclear receptor family 
also includes receptors for glucocorticoids, androgens, mineralocorticoids, progestins, thyroid 
hormones, vitamin D, retinoids* peroxisome proliferators and eicosanoids. A subset of the 
nuclear receptor family are the steroid receptors, which include the estrogen, glucocorticoid and 
progestin receptors. 

25 Overall sequence conservation between nuclear receptors varies between different 

families of receptors; however, sequence conservation between functional regions, or modules, 
of the receptors is high. For example, nuclear receptors can be organized into functional 
modules comprising an N-terminal transcriptional activation domain, a central DNA binding 
domain (DBD), and a C-terminal ligand binding domain (LBD). The LBD of nuclear receptors 

30 represents a hormone/ligand-dependent molecular switch, and recognizes a variety of 

compounds diverse in their size, shape and chemical properties. Accordingly, the estrogen 
hormones exert their physiological effects by binding to the estrogen receptor (Beato, et al., Cell 
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83(6):85 1-857 (1995); Tsai. et al.. Annu . Rev . Biochem . 63:451-86 ( 1 994)). Binding of a 
hormone lo a nuclear receptor's LBD also changes its ability to modulate transcription of DNA, 
although they may have transcription-independent actions.. 

All ERct ligands bind exclusively to the C-terminal LBD. Some of these ligands. 
5 including the endogenous estrogen. 1 7p-estradiol (E 2 ), and the synthetic nonsteroidal estrogen, 
diethylstilbestrol (DES), function as pure agonists whereas others such as ICI- 164384 function 
as pure antagonists. Synthetic ligands such as tamoxifen and raloxifene (RAL) belong to a 
growing class of molecules known as selective estrogen receptor modulators (SERMs), which 
function as antagonists in specific tissue and promoter contexts (Grese, et al.. Proc. Natl . Acad . 

10 Sci. USA 94:14105-10 (1997)). The remarkable tissue-specific behavior of tamoxifen was 

recently demonstrated in a breast cancer prevention trial, reported in Smigel. J. Natl . Cancer Inst . 
90:647-8 (1998), where a group of women at high risk for breast cancer who received tamoxifen 
treatment over a six year period, exhibited an increased incidence of endometrial cancer but a 
reduced occurrence of certain bone fractures and a dramatic 45% reduction in breast cancer 

1 5 incidence. The rational design of new SERMs and the optimization of existing ones require an 
understanding of the effects of different ligand chemistries and structures upon ERa 
transcriptional activity. 

Nuclear receptors also bind proteins, such as chaperone complexes, compressors, or 
coactivators, that are involved in receptor function. In particular, ligand-dependent activation of 

20 transcription by nuclear receptors is mediated by interactions with coactivators. Receptor 
agonists promote coactivator binding and antagonists block coactivator binding. Hormone 
binding by a nuclear receptor can increase or decrease binding affinity to these proteins, and can 
influence or mediate the multiple actions of the nuclear receptors on transcription. 

Transcriptional activation by ERa is mediated by at least two separate activation 

25 functions (AFs) located within different domains of the protein, AF-1 in the N-terminus, and AF- 
2 in the LBD. These AFs can act independently or cooperatively, depending on the cell type and 
the promoter context. The activity of AF-1 is regulated by growth factors acting through the 
MAP kinase pathway (Kato. et al.. Science 270:1491-1494 (1995)) and is generally believed to 
be activated in a ligand-independent manner, while AF-2 activity ("transcriptional activity") is 

30 responsive to ligand binding (Kumar, et al.. Cell 5 1 (6):941 -95 1 (1987)). The binding of agonists 
triggers transcriptional activity whereas the binding of antagonists does not (Berry, et al., EMBO 
J. 9:281 1-8 (1990)). In addition, coactivators mediate transcriptional activity. The structural and 
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functional nature of the site to which coactivators bind has only recently been defined. AprilettL 
et al., US Provisional No. 60/079,956, filed March 30. 1998. the disclosure of which is 
incorporated herein by reference. 

Recent structural studies suggest that ligands regulate transcriptional activity by directly 
5 affecting the structure of the LBD. Comparison of the structure of the unliganded human 

retinoid X receptor a LBD (Bourguet, et al., Nature 375:377-82 (1995)) with the structures of 
the liganded LBDs of the human retinoic acid receptor y (RARy) (Renaud, et al.. Nature 
378:681-689 (1995) and Wurtz, et al., Nat. Struct . Biol . 3:87-94(1996)), the thyroid hormone 
receptor a (TRa) (Wagner, et al., Nature 378:690-697 (1995)), the progesterone receptor 

10 (Williams, et al., Nature 393:392-395 (1998)) and the ERa (Brzozowski, et al., Nature 389:753- 
758 (1997); Tanenbaum, et al., Proc . Natl . Acad . Sci- USA 95:5998-6003 (1998)) suggests that 
an agonist-induced conformational change involving the repositioning of helix 12, the most C- 
terminal helix of the LBD, is essential for transcriptional activity. Because certain point 
mutations in helices 3, 5 and 12 abolish transcriptional activity but have no effect on ligand or 

1 5 DNA binding, these regions of the LBD have been predicted to form part of a recognition 

surface, created in the presence of agonist, for molecules that link the receptor to the general 
transcriptional machinery (Danielian, et al., EMBO J. 1 1 : 1025-33(1 992); Feng, et al.. Science 
280:1747-9 (1998); Henttu, et al., Mol . Cell . Biol . 17:1832-9 (1997); Wrenn, et al., J. Biol. 
Chem. 268:24089-24098 (1993)). The structures of the LBD complexed with E 2 and RAL show 

20 that although both ligands bind at the same site within the core of the LBD (Brzozowski. et al., 
supra ), each of these ligands induces a different conformation of helix 12. Whereas helix 12 in 
the E 2 -LBD complex packs against the helices 3. 5/6 and 1 1 in a conformation that has been 
observed for the corresponding helix in other agonist-bound NR LBD structures, helix 12 in the 
RAL-LBD complex is bound in a hydrophobic groove composed of residues from helices 3 and 

25 5. This alternative orientation of helix 12 partially buries residues in the groove that are 

necessary for transcriptional activity, suggesting that RAL and possibly other antagonists block 
transcriptional activity by disrupting the topography of the coactivator binding site surface. 

Biochemical and genetic approaches have led to the identification of several proteins that 
associate in a ligand-dependent manner with ERa (Horwitz, et al., Mol . Endocrinol . 10:1 167- 

30 1 177 (1996)) including SRC-l/N-CoAl (Onate, et al.. Science 270:1354-1357 (1995)), 

GRIP1/TIF2/SRC-2 (Hong, et al., Proc . Natl . Acad . Sci. USA 93(10):4948-4952 (1996) and 
Voegel, et al. EMBO J. 15:3667-3675 (1996)), p/CIP/RAC3/ACTR/AIBl/SRC-3 (Anztck. et al.. 
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Science 277:965-968( 1 997). Chen, et al.. Cell 90(3):569-80 ( 1 997). Li. et al., Proc . Natl . Acad . 
Sci . USA 94:8479-84 (1997) and Torchia. et al.. Nature 387:677-684 (1997)) and CBP/p300 
(Hanstein, et al., Proc . Natl . Acad . Sci . USA 93:1 1540-1 1545 (1996)). These proteins have been 
classified as transcriptional coactivators because they enhance ligand-dependent transcriptional 
5 activation by ERa as well as by several other NRs (Glass, et al., Curr . Opin . Cell Biol . 9:222-32 
(1997); Torchia, et al., supra ). The observation of partial hormone resistance in mice with a 
disrupted SRC-1 gene (Xu. et al., Science 279:1922-1925(1998)) provides compelling evidence 
that coactivators are required for NR function in vivo. Consistent with its proposed role in AF-2 
directed transcriptional activation, SRC-1 possesses histone acetylase activity and the ability to 

10 interact not only with agonist-bound receptors but also with other coactivators and several 
general transcription factors (Kamei, et al., Cell 85(3):403-14 (1996); Onate, et al., supra : 
Spencer, et al., Nature 389:194-8 (1997); Takeshita, et al.. Endocrinology 137:3594-7 (1996)). 
SRC-1 and GRIP1 also bind to the agonist-bound LBDs of both the human TRP and human ERa 
using the putative coactivator binding site (Feng, et al., supra ). 

1 5 Members of the p 1 60 family of coactivators such as SRC- 1 , GRIP 1 /TIF2/SRC-2, and 

p/CIP/RAC3/ACTR/AIBl/SRC-3 as well as other coactivators recognize agonist-bound NR 
LBDs through a short signature sequence motif, LXXLL (SEQ ID NO:l) (where L is leucine and 
X is any amino acid), known as the NR box (Ding, et al., Mol . Endocrinol . 12:302-313 (1998); 
Heery, et al., Nature 387:733-736 (1997); Le Douarin, et al., EMBO J. 15:6701-15 (1996); 

20 Torchia, et al., supra ). Mutagenesis studies indicate that the affinity of coactivators for NR 
LBDs is determined principally, if not exclusively, by these NR boxes (Ding, et al., supra ): 
Heery, et al., Nature 387:733-736 (1997); Le Douarin, et al., EMBO J. 15:6701-15 (1996); 
Torchia, et al., supra ). Each of the pi 60 coactivators contains several NR boxes. The NR boxes 
within SRC-1 , GRIP1 and TIF2 have been demonstrated to recognize different NRs with 

25 different affinities (Ding, et al., supra : Kalkhoven, et al., EMBO J. 17:232-43 (1998); Voegel, et 
al., EMBO J. 17:507-19 (1998)), but the reasons for these binding preferences are unknown. 

Darimont, et al., "Structure and specificity of nuclear receptor-coactivator interactions" 
Genes Dev . 12:3343-3356 (1998) describes structural studies of the complex between TRp and 
the GRIP1 NR Box II peptide and biochemical studies of GRIP1 binding to TRp and GR. The 

30 PPARy/SRC-1 peptide complex is described in Nolte, et al.. Nature 395:137-143 (1998). 

The medical importance of nuclear receptors is significant. They have been implicated in 
breast cancer, prostate cancer, cardiac arrhythmia, infertility, osteoporosis, hyperthyroidism, 

4 
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hypercholesterolemia, obesity and other conditions. For example, compounds that modulate 
ERa transcriptional activity are currently being used to treat osteoporosis, cardiovascular disease 
and breast cancer (Gradishar. et ah, J. Clin. Oncol . 1 5:840-52 ( 1 997) and Jordan. J. Natl . Cancer 
Inst . 90:967-71 (1998)). 
5 A need continues to exist for further identification and characterization of the key 

residues within the ligand binding domains of the nuclear receptors, and molecules that affect the 
receptor by binding to these sites. Understanding these interactions provides a basis for iterative 
drug clesigri, synthesis; arid selection. It also would be advantageous to devise methods and 
compositions for reducing the time required to discover compounds that target these binding 

10 sites and administer them to organisms to modulate physiological processes regulated by the 
nuclear receptors, and the estrogen receptor in particular. 

SUMMARY OF THE INVENTION 
The present invention relates to the further identification and manipulation of the ligand 
binding domain (LBD) of nuclear receptors, which facilitates the design of compounds that bind 

15 to the LBD and modulate nuclear receptor activity, and the estrogen receptor in particular. The 
compounds include agonists and antagonists that modulate nuclear receptor activity, and can be 
receptor-, cell- and/or tissue-specific. In particular, the compounds modulate nuclear receptor 
activity by affecting coactivator-coactivatpr binding site interactions. 

The present invention also includes protein cocrystals of the nuclear receptors with an 

20 agonist bound to the LBD and a peptide bound to the coactivator binding site and methods for 
making them. Similarly, the invention also includes protein cocrystals of the nuclear receptors 
with an antagonist bound to the LBD and methods for making them. The cocrystals provide 
means to obtain atomic modeling information of the specific amino acids and their atoms 
forming the LBD and coactivator binding sites and that interact with molecules that bind to the 

25 sites. The cocrystals also provide modeling information regarding the ligand:nuclear receptor 
and coactivator:nuclear receptor interactions, as well the structure of ligands bound thereto. 

The present invention further provides methods for identifying and designing molecules 
that modulate ligand binding to a nuclear receptor using atomic models of nuclear receptors. 
The method involves modeling test compounds that fit spacially into a nuclear receptor LBD 

30 using an atomic structural model comprising a nuclear receptor LBD or portion thereof. 

screening the test compounds in an assay, such as a biological assay, characterized by binding of 
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a test compound to the nuclear receptor LBD. and identifying a test compound that modulates 
ligand binding to the receptor. 

The invention also includes compositions and methods for identifying key residues 
within the LBDs oi nuclear receptors. The methods involve examining the surface of a nuclear 
receptor of interest to identify residues that modulate ligand and/or coactivator binding. The 
residues can be identified by homology to the key residues within the LBD of human ERa 
described herein. Overlays and superpositioning with a three dimensional model of a nuclear 
receptor's LBD. and/or a portion thereof, also can be used for this purpose. Additionally, 
alignment and/or modeling can be used as a guide for the placement of mutations on the LBD 
surface to characterize the nature of the site in the context of a cell. 

Also provided is a method of modulating the activity of a nuclear receptor. The method 
can be in vitro or in vivo. The method comprises administering in vitro or in vivo a sufficient 
amount of a compound that binds to the ligand binding domain and acts either as an agonist or an 
antagonist. Preferred compounds bind to the site with greater affinity than ligands found in a cell 
of interest. 

The invention further includes a method for identifying an agonist or antagonist of ligand 
binding to a nuclear receptor. The method comprises providing the atomic coordinates 
comprising a nuclear receptor ligand binding domain or portion thereof to a computerized 
modeling system; modeling compounds which fit spatially into the nuclear receptor ligand 
binding domain; and identifying in an assay, for example a biological assay, for nuclear receptor 
activity a compound that increases or decreases activity of the nuclear receptor through binding 
the ligand binding domain. 

Also provided is a machine-readable data storage medium with information for 
constructing and manipulating an atomic model comprising the ligand binding domain or 
portions thereof. The medium comprises a data storage material encoded with machine readable 
data which, when using a machine programmed with instructions for using said data, is capable 
of displaying a graphical three-dimensional representation of a molecule or molecular complex 
for a nuclear receptor ligand binding domain. 

Also provided is a method of identifying a compound that selectively modulates the 
activity of one type of nuclear receptor compared to other nuclear receptors. The method is 
exemplified by modeling test compounds that fit spatially and preferentially into a nuclear 
receptor ligand binding domain of interest using an atomic structural model of a nuclear receptor 
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LBD, selecting a compound that interacts with one or more residues of the LBD unique in the 
context of that site, and identifying in an assay, for example a biological assay, for Hgand 
binding activity a compound that selectively binds to the LBD compared to other nuclear 
receptors. The unique features involved in receptor-selective ligand binding can be identified by 
5 comparing atomic models of different nuclear receptors or isoforms of the same type of receptor. 

The invention finds use in the selection and characterization of peptide, peptidomimetic, 
as well as other compounds, such as small organic molecules, identified by the methods of the 
invention, particularly new lead compounds useful in treating nuclear receptor-based disorders, 
in particular steroid receptor-based disorders, and more specifically estrogen receptor-based 
10 disorders. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 provides a stereo view of the electron density of the complexes, where Figure 
1 A is a stereo view of the electron density of the DES-ERa LBD-GRIP1 NR Box 11 peptide 
complex and Figure 1 B is a stereo view of the electron density of the OHT-ERct LBD complex. 
15 Figure 1 is a black and white graphical representation of a figure that was generated using 
BOBSCRIPT (Esnouf, J. MoL Graph . Model . 15, 132-4, 1 12-3 (1997)) and rendered using 
Raster3D (Merritt, et ah, Acta Crvstallogr . D 50:869-873 (1994)). 

Figure 2 was generated using BOBSCRIPT and rendered using Raster3D as described 
above. Figure 2A shows the overall structure of the DES-ERa LBD-GRIP1 NR Box II peptide 
20 complex in two orthogonal views. Figure 2B shows the overall structure of the OHT-ERct LBD 
complex in two orthogonal views similar to those of the agonist complex in Figure 2A. 

Figures 3 A and 3B were generated using BOBSCRIPT and rendered using Raster3D as 
described above. Figures 3C and 3D were created using GRASP (Nicholls. GRASP Manual 
(New York: Columbia University, 1992)). Figure 3 A shows a close-up view of the coactivator 
25 peptide bound to the LBD, i.e., the NR Box II peptide/LBD interface. The regions of the LBD 

that do not interact with the peptide have been omitted for clarity. Helices 3, 4 and 5 are labeled 
H3. H4 and H5 respectively. The side chains of receptor residues which interact with the peptide 
are depicted, except for Lys 362 (blue) and Glu 542 (red), the side chains are colored by atom 
type (carbon and sulfur atoms are colored green, oxygen atoms are colored red and nitrogen 
30 atoms are colored blue). Helix 1 2 is colored magenta. The peptide, colored gold, is depicted as 
a Cot worm; only the side chains of He 689 and the three motif leucines (Leu 690, Leu 693 and 
Leu 694) are drawn (Figure 3C). Figure 3B shows the helix 12/LBD interface as a close-up view 

7 



RNSnnr.in: <WO 995065SA2 I > 



WO 99/50658 



PCI7US99/06937 



of the OHT-LBD complex showing helix 12 bound to part of the coactivator binding site. Only 
the side chains of residues that interact with helix 12 are drawn (with the exception of side chain 
of His 373 which is omitted for clarity). Except for Lys 362 (blue) and Glu 380 (red), the side 
chains are colored by atom type as specified for Figure 3 A. Residues 530-551 are depicted as a 
5 Ca worm; residues 536-544 are colored magenta. The side chains of Leu 536, Tyr 537, Leu 

540, Met 543 and Leu 544 are shown. Figure 3C is a molecular surface representation showing 
the electrostatic surface of the ERct LBD bound to the NR Box II peptide as positive (blue) and 
negative (red) regions, as calculated in GRASP. The coactivator peptide is depicted as in Figure 
3 A and the view is equivalent to that in Figure 3 A. The side chains of Leu 690 and Leu 694 are 

10 bound in a hydrophobic groove and those of He 689 and Leu 693 rest against the edge of this 
groove. Figure 3D shows the electrostatic surface of the ERa LBD complexed with OHT, 
showing positive (blue) and negative (red) regions as in Figure 3C. Residues 530-51 1 are 
depicted as in Figure 3B and the view is equivalent to that in Figure 3B. Whereas the side chains 
of Leu 540 and Leu 544 are embedded in the hydrophobic groove, that of Met 543 lies along the 

1 5 edge of this groove. 

Figure 4 was generated using LIGPLOT (Wallace, et a!.. Protein Eng . 8:127-34 (1995)) 
and provides schematic diagrams illustrating the DES interactions with the LBD (Figure 4A) and 
OHT interactions with the ligand binding pocket (Figure 4B). Residues that interact with the 
ligands are drawn at approximately their true positions. The residues that form van der Waals 

20 contacts with ligand are depicted as labeled arcs with radial spokes that face towards the ligand 
atoms with which they interact. The residues that hydrogen bond to ligand are shown in ball- 
and-stick representation. Hydrogen bonds are represented as dashed cyan lines and the distance 
of each bond is given. The ligand rings and the individual ligand atoms are labeled. 

Figure 5 was generated using BOBSCRIPT and rendered using Raster3D as described 

25 above, and shows a comparison of helix 12 from the OHT complex and the NR Box II peptide. 
Figure 5A and Figure 5B are stereo views. The structures of the OHT-LBD complex and the 
DES-LBD-NR Box II peptide complex were overlapped using the Ca coordinates of residues 
306-526 of the LBD. Helix 12 from the DES-LBD-coactivator peptide complex is omitted for 
clarity. Residues 536-551 (helix 12=residues 536-544) from the OHT-LBD complex are colored 

30 magenta and the peptide is colored gold. The hydrogen bonds between the e-amino group of Lys 
362 and the backbone carbonyls of residues 543 and 544 of helix 12 are illustrated as dashed 
magenta lines. The hydrogen bonds between the e-amino group of Lys 362 and the backbone 

8 



QMcrvvirv *-\*jr\ 



WO 99/50658 



PCT/US99/06937 



carbonyls of residues 693 and 696 of the coactivator peptide are depicted as dashed orange lines. 
The following abbreviations are used on helix 12: L540=Leu 540. M543=Met 543, and 
L544=Leu 544. The following abbreviations are used on the peptide: L690=Leu 690, L693=Leu 
693 and L694=Leu 694. 

5 Figures 6A and 6D were generated using BOBSCRIPT and rendered using Raster3D as 

described above. Figures 6B and 6C were created using MidasPlus (Huang, et al.. J. Mol. Graph . 
9:230-6. 242 ( 1 99 1 )). Figure 6A shows that agonists and antagonists promote different LBD 
conformations, as ribbon representations of the DES complex (without the coactivator peptide), 
the OHT complex and the E 2 complex such as is described in Tanenbaum, et ah, supra . The 

10 hormones are shown in space-filling representation. In each complex, helix 12 is colored 

magenta and the main chain of residues 339 to 341, 421 to 423, and 527 to 530 is indicated in 
red. Helices 3, 8 and 1 1 (H3, H8 and HI 1 respectively) are labeled in the DES complex. Figure 
6B shows DES bound in the ligand binding cavity. A cross-section of a space-filling model of 
the LBD bound to DES (green) showing the ligand completely embedded in the ligand binding 

15 cavity. The A' ring of DES (A'), Phe 404 (404), Met 421 (421) and Phe 425 (425) are labeled. 
The carbon atoms of side chain of Met 421 are colored magenta, and the sulfur atom is colored 
yellow. Figure 6C is a cross-section of a space-filling model of OHT (red) bound in the ligand 
binding pocket. The view is equivalent to that in Figure 6B. The B ring of OHT (B), Phe 404 
(404), Met 421 (421) and Phe 425 (425) are labeled. The side chain of Met 421 is colored as in 

20 Figure 6B. The conformation of the B ring forces Met 421 to adopt a different conformation 
than in the one it adopts in the DES complex (compare with Figure 6B). Figure 6D provides a 
. comparison of the ligand binding pocket bound to DES (green) and to OHT (red). The structures 
of the OHT complex and the DES complex were overlapped as in Figure 5. The A rings of both 
ligands point out of the page; the B ring of OHT and the A' ring of DES point into the page. The 

25 LBD bound to OHT is colored blue and the LBD bound to DES is colored light blue-grey. The 
side chains of some of the residues whose conformations are dramatically different between the 
two complexes are drawn; Met 342 (342). Met 343 (343), Phe 404 (404), Met 421 (421), He 424 
(424), Phe 425 (425), His 524 (524), Leu 525 (525), Met 528 (528). The sulfur atom of Met 421 
is colored yellow in both structures. 

30 Figure 7 illustrates a model of antagonist action. Agonist (white triangle) binding 

stabilizes a conformation of the LBD that promotes coactivator (yellow) binding. Residues 527- 
530 (red) are part of helix 1 1 (blue) and the length of the interhelical loop prevents helix 12 
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(magenta ) from binding to the static region of the surface involved in transcriptional activity. 
Antagonist (white cross) side chains preclude helix 12 from being positioned over the ligand 
binding pocket. Residues 527-530 (red) adopt an extended conformation as a result of 
antagonist-driven structural perturbations in and around the ligand binding pocket. The length of 
5 the loop between helices 1 1 and 12 allows helix 12 to bind the static region of this surface and 
inhibit coactivator recognition. 

Figure 8 shows alignment of amino acid sequences (single letter amino acid designations) 
containing residues that form the coactivator binding sites of several nuclear receptors: human 
and recombinant thyroid hormones (hTRp and rTRa) (SEQ ID NO:5 and 6 and SEQ ID NO:7 

10 and 8), retinoids (hRARy and hRXRa) (SEQ ID NO:9 and 10 and SEQ ID NO:l 1 and 12), 

peroxisome (hPPARy) (SEQ ID NO: 13 and 14), vitamin D (hVDR) (SEQ ID NO: 15 and 16), 
estrogen (hERct) (SEQ ID NO: 17 and 18), glucocorticoid (hGR) (SEQ ID NO: 19 and 20), 
progestin (hPR) (SEQ ID NO:21 and 22), mineralocorticoid (hMR) (SEQ ID NO:23 and 24) and 
androgen (hAR) (SEQ ID NO:25 and 26). The boxes represent residues of alpha-helix (H3, H4, 

15 H5, H6 and HI 2); lower case letters "h" and "q" represent hydrophobic and polar residues, 

respectively. The numbered positions at the top of the table, 280. 281, etc., correspond to hTRP 
residues. 

DESCRIPTION OF SPECIFIC EMBODIMENTS 
The present invention provides methods and compositions for identifying compounds 
20 that modulate nuclear receptor activity, in particular steroid receptor activity, and more 
particularly estrogen receptor activity. The compounds are nuclear receptor agonists or 
antagonists that bind to the ligand binding domain. Compounds that bind to the LBD also are 
provided. The compounds can be natural or synthetic. Preferred compounds are small organic 
molecules, peptides and peptidomimetics (e.g., cyclic peptides, peptide analogs, or constrained 
25 peptides). 

Certain residues within the ERa LBD have been identified that are of particular 
importance:Met343, Leu346, Ala350, Glu353. Leu384, Leu387, Leu391, Arg394. Phe404, 
Met421. Leu428. Gly521, His524, Leu525 and Met528 (See Figure 4 A). Of these, some have 
been found to directly or indirectly effect the positioning of helix 12: Met343. Met421, His524, 
30 Leu525 and Met528. Interactions with these particular residues, such as occurs when DES binds 
to the receptor stabilizes a conformation of the LBD that promotes coactivator binding. 
Modifications to a ligand that enhance binding or interaction with these residues would provide 
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for an improved agonist of receptor activity. Similarly, modifications to a ligand that adversely 
affects the binding or interaction with these residues would provide for an improved antagonist. 

In addition, it is believed that the ERa Tyr537 residue plays a role in stabilizing the 
unliganded receptor so that helix 12 is free to interact with the coactivator binding site. The ER 
5 is quite unique in having a tyrosine at this position as hTRp, rTRcx, hRARy, hGR. hPR, hMR and 
hAR all have a proline residue. hRXRa has as aspartic acid residue, hPPARy has a histidine 
residue and hVDR has a threonine residue at positions corresponding to the Tyr 537 residue of 
hERa. Therefore, selective agonists and antagonists can be designed for the estrogen receptor 
that interact with Tyr 537. 

10 Traditional antagonists such as OHT or RAL have side chains that directly block helix 1 2 

positioning as it is in the agonist complexes (DES or E?). In addition, it is expected that ligand- 
induced perturbations are required to allow helix 12 to reach into the coactivator helix binding 
site, thereby blocking coactivator binding and inhibiting transcriptional activity. Such 
perturbations or interference with proper interactions with one or more of these LBD residues, 

1 5 has been found to relax the receptor, disassembling the receptor's secondary structure and 

resulting in unwinding of helix 1 1 . The unwinding of helix 1 1 increases the length of the loop 
between helices 1 1 and 12, allowing helix 12 to move away from the ligand binding pocket and 
towards the coactivator binding site, where it occludes the coactivator recognition groove by 
mimicking the interactions of the coactivator, and thus inhibits coactivator recognition (see 

20 Figure 7). Modifications to a ligand that interfere with binding or interaction with one or more 
of the amino acids positions indicated, would cause receptor relaxation, affecting the receptor's 
secondary structure and cause the unwinding of helix 12. Compounds based upon such modified 
ligands would act as antagonists. 

Accordingly, one aspect of the invention is a method of identifying a compound that 

25 modulates (i.e., increases or decreases) nuclear receptor activity, comprising: modeling test 

compounds that fit spatially into a nuclear receptor ligand binding domain of interest using an 
atomic structural model of the estrogen receptor a ligand binding domain or portion thereof, 
screening the test compounds in an assay, for example a biological assay, characterized by 
binding of a test compound to the ligand binding domain, and identifying a test compound that 

30 modulates nuclear receptor activity, wherein the atomic structural model comprises atomic 
coordinates of amino acid residues corresponding to residues of human estrogen receptor a 
Met343, Leu346, Ala350, Glu353. Leu384, Leu387, Leu391 , Arg394, Phe404, Met421, Leu428. 

1 1 
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GW521. His524. Leu525 and Met528. preferably Met343, Met42L His524. Leu525 and Met528. 
In a preferred embodiment, the nuclear receptor is the ER. The test compound can be an agonist 
and nuclear receptor activity is measured by binding of a coactivator or a compound that mimics 
a coactivator, to the coactivator binding site, as defined below. On the other hand, the test 
5 compound can be an antagonist and nuclear receptor activity is measured by the unwinding of 
helix 12 and/or the blocking of coactivator binding to the coactivator binding site. The screening 
is typically in v/7ro, and high throughput screening is preferable. Suitable test compounds can be 
designed, as is described later, or can be obtained from a library of compounds, and include, by 
means of illustration and not limitation, small organic molecules, peptides and peptidomimetics. 

1 0 The method described above may also include the step of providing the atomic coordinates of the 
estrogen receptor a ligand binding domain or portion thereof to a computerized modeling 
system, prior to said modeling step. 

As used herein, the term "portion thereof 1 is intended to mean the atomic coordinates 
corresponding to a sufficient number of residues or their atoms of the LBD that interact with a 

1 5 compound capable of binding to the site. This includes receptor residues having an atom within 
4.5A of a bound compound or fragment thereof. Thus, for example, the atomic coordinates 
provided to the modeling system can contain atoms of the nuclear receptor LBD, part of the LBD 
such as atoms corresponding to the LBD or a subset of atoms useful in the modeling and design 
of compounds that bind to a LBD. 

20 The atomic coordinates of a compound that fits into the ligand binding domain also can 

be used for modeling to identify compounds or fragments that bind the site. By "modeling" is 
intended quantitative and qualitative analysis of molecular structure/function based on atomic 
structural information and receptor-ligand agonists/antagonists interaction models. This includes 
conventional numeric-based molecular dynamic and energy minimization models, interactive 

25 computer graphic models, modified molecular mechanics models, distance geometry and other 
structure-based constraint models. Modeling is preferably performed using a computer and may 
be further optimized using known methods. By 'Tits spatially" is intended that the three- 
dimensional structure of a compound is accommodated geometrically by a cavity or pocket of a 
nuclear receptor LBD. 

30 It is expected that targeting the corresponding amino acids on other nuclear receptors will 

have the same effect. Accordingly, one embodiment of the invention pertains to methods of 
designing antagonists that bind the LBD of a nuclear receptor but do not interact with one of 
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more residues within the LBD that correspond to (i.e., the same as or equivalent to) human ERa 
residues Met343. Leu346, Ala350, Glu353, Leu384. Leu387. Leu391, Arg394, Phe404. Met421. 
Leu428, Gly52 1. His524. Leu525 and Met528, preferably Met343, Met42K His524. Leu525 and 
Met528. Similarly, another embodiment of the invention pertains to methods of designing 
5 agonists that bind the LBD of a nuclear receptor and have enhanced interaction with one or more 
residues within the LBD that correspond to the human ERa residues Met343, Leu346 ? Ala350, 
Glu353, Leu384. Leu387, Leu39L Arg394, Phe404. Met421, Leu428, Gly521, His524, Leu525 
and Met528, preferably Met343, Met421, His524, Leu525 and Met528. An example of 
enhanced interaction is where the agonist has a greater binding affinity for one or more of said 

10 residues, as compared to an endogenous ligand. Such corresponding positions for other 

members of the nuclear receptor family are shown in Table 1 . which provides an alignment of 
amino acid sequences (one letter amino acid designations) containing residues from the ligand 
binding domains of several nuclear receptors that correspond to the designated positions on the 
human estrogen receptor (hERa): recombinant thyroid hormone (rTRct), retinoids (hRARy and 

1 5 hRXRct), glucocorticoid (hGRct), progestin (hPR), rnineralocorticoid (hMR) and androgen 

(hARa). It is understood that Table 1 is merely illustrative of the invention and is not intended 
to be limiting in any manner. Accordingly, it is understood that corresponding amino acid 
residues of other nuclear receptors such as other estrogen receptors, thyroid receptors, retinoid 
receptors, glucocorticoid receptors, progestin receptors, rnineralocorticoid receptors, androgen 

20 receptors, peroxisome receptors and vitamin D receptors, may also be used in the methods of the 
invention. 

Table 1 



hERa 


M343 


M421 


F425 


H524 


L525 


M528 


hPR 


L715 


F794 


1798 


Y890 


C891 


T894 


hARa 


L701 


M780 


Q784 


F876 


T877 


L881 


hGRa 


M560 


M639 


Q643 


Y735 


C736 


T740 


hMR 


L766 


M845 


L849 


F941 


C942 


T946 


hRARy 


W227 


F304 


L308 


R396 


A397 


L401 


hRXRa 


V265 


G343 


F346 


H435 


L436 


F440 


rTRa 


F215 


L292 


V296 


R384 


F385 


M389 



The term "coactivator binding site" is used herein to mean a structural segment or 
segments of the nuclear receptor polypeptide chain folded in such a way so as to give the proper 
geometry and amino acid residue conformation for binding coactivator. This is the physical 
arrangement of protein atoms in three-dimensional space forming a coactivator binding site 
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pocket or cavity. As described by Apriietti. et aL supra , residues forming the coactivator 
binding site on nuclear receptors are amino acids that correspond to (i.e.. the same as or 
equivalent to) human TR residues of C-terminal helix 3 (Ile280. Thr281. Val283. Val284, 
Ala287. and Lys288), helix 4 (Phe293). helix 5 (Gln301, IIe302. Leu305. Lys306), helix 6 
(Cys309). and helix 12 (Pro453. Leu454. Glu457, Val458 and Phe459). as shown in Figure 8. 
The coactivator binding site is highly conserved among the nuclear receptor super family. Thus, 
this site corresponds to a surprisingly small cluster of residues on the surface of the LBD that 
form a prominent hydrophobic cleft. The hydrophobic cleft is formed by hydrophobic residues 
corresponding to human TR residues of C-terminal helix 3 (Ile280. VaI283. Val284, and 
Ala287), helix 4 (Phe293), helix 5 (Ile302 and Leu305), helix 6 (Cys309), and helix 12 (Leu454, 
Val458 and Phe459). This hydrophobic cleft of the coactivator binding site is also highly 
conserved among the nuclear receptor super family. 

Based upon the Examples set forth herein, residues forming the coactivator binding site 
on the estrogen receptor were found to correspond to those positions described above for the 
human TR. Accordingly, the residues forming the coactivator binding site on ERct are the 
human ERct residues of C-terminal helix 3 (Leu354, Val355, Met357, Ile358. Ala361, and 
Lys362), helix 4 (Phe367), helix 5 (Gln375, Val376, Leu379, Glu380). helix 6 (Trp383), and 
helix 12 (Asp538, Leu539, Glu542, Met543 and Leu544), as shown in Figure 8. As noted above 
for the nuclear receptor family in general, this site corresponds to residues on the surface of the 
LBD that form a prominent hydrophobic cleft, formed by hydrophobic residues corresponding to 
human ERct residues of C-terminal helix 3 (Leu354, Met357, Ile358 and Aia361 ), helix 4 
(Phe367). helix 5 (Val376, Leu379). helix 6 (Trp383), and helix 12 (Leu539. Met543 and 
Leu544). This corroborates the data presented by Apriietti, et al., supra , for the nuclear receptor 
family. 

Structural analysis has revealed the mechanisms by which tamoxifen and other SERMs 
bind the ligand binding domain and block coactivator binding and hence transcriptional activity. 
By this, an understanding of ligand and coactivator binding has also been achieved. Therefore, 
the coactivator binding site residues described above are useful in designing coactivator mimics 
that have broad application in the methods of the instant invention. Such "coactivator mimics" 
are peptides or polypeptides that mimic the coactivator binding site recognition area on the 
surface of a coactivator such that a "coactivator mimic" acts as a competitive inhibitor of 
coactivator binding to the coactivator binding site. Coactivator mimics can be used in an assay 
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to determine receptor activity and hence the agonist or antagonist nature of a test compound, in 
that an agonist will permit a coactivator mimic to bind to the coactivator binding site, while an 
antagonist will prevent such binding. In addition, such coactivator mimics may have therapeutic 
utility when administered in combination with an agonist compound of the invention. 
5 Another embodiment of the invention pertains to a method of identifying a compound 

that modulates ligand binding to a nuclear receptor, typically by binding to the ligand binding 
domain. This method comprises the steps of modeling test compounds that fit spatially into a 
nuclear receptor ligand binding domain of interest using an atomic structural model of the 
estrogen receptor a ligand binding domain or portion thereof, screening the test compounds in an 

10 assay characterized by binding of a test compound to the binding domain, and identifying a test 
compound that modulates ligand binding to said nuclear receptor, wherein said atomic structural 
model comprises atomic coordinates of amino acid residues corresponding to residues of human 
estrogen receptor a Met343, Leu346, Ala350, Glu353. Leu384, Leu387, Leu391, Arg394, 
Phe404, Met421, Leu428, GIy521, His524, Leu525 and Met528, preferably Met343, Met421, 

1 5 His524, Leu525 and Met528. In the preferred method, the nuclear receptor is ER, TR, GR or 
PR. The screening is typically in vitro such as by high throughput screening. Suitable test 
compounds can be designed or obtained from a library of compounds, and include, by means of 
illustration and not limitation, small organic molecules, peptides and peptidomimetics. The test 
compounds can be either agonists or antagonists of ligand binding. 

20 The invention also includes compositions and methods for identifying key residues 

within the ligand binding domains of nuclear receptors. The methods involve examining the 
surface of a nuclear receptor of interest to identify residues that modulate ligand and/or 
coactivator binding. The residues can be identified by homology to the key residues on the LBD 
of human ERct described herein. A preferred method is alignment with the residues of any 

25 nuclear receptor corresponding to (i.e., equivalent to) human ERa residues of Met343, Leu346, 
Ala350, Glu353, Leu384, Leu387, Leu39h Arg394, Phe404, Met421, Leu428. Gly52L His524, 
Leu525 and Met528, preferably Met343, Met42L His524, Leu525 and Met528. Overlays and 
superpositioning with a three-dimensional model of a nuclear receptor LBD, or a portion thereof 
that contains these or corresponding residues, also can be used for this purpose. For example, 

30 three-dimensional structures of TR, GR arid PR LBDs can be used for this purpose. For 

example, nuclear receptors identifiable by homology alignment include normal nuclear receptors 
or proteins structurally related to nuclear receptors found in humans, natural mutants of nuclear 
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receptors found in humans, normal or mutant receptors found in animals, as well as non- 
mammalian organisms such as pests or infectious organisms, or viruses. 

Alignment and/or modeling also can be used as a guide for the placement of mutations on 
the LBD surface to characterize the nature of the site in the context of a cell. Selected residues 
5 are mutated to preserve global receptor structure and solubility in the case of an agonist, or to 
disassemble such structure and permit helix 1 2 to unwind, as is the case with an antagonist. 
Mutants can be tested for ligand binding as well as the relative change in strength of the binding 
interaction. Ligand-dependent coactivator interaction assays also can be tested for this purpose, 
such as those described herein. 

1 0 In particular, the present invention relates to the structural and functional effects on the 

estrogen receptor's LBD. of the binding of two chemically-related compounds, the agonist, 
diethylstilbestrol (DES). and the selective antagonist 4-hydroxytamoxifen (OHT), the active 
metabolite of tamoxifen. As described in the Examples, mutagenesis and binding studies, 
coupled with analysis of atomic models derived from cocrystals, reveals the structure of the 

15 human estrogen receptor a ligand binding domain (ERa LBD) co-crystallized with a peptide 
molecule comprising a GRIP1 NR Box II peptide sequence (SEQ ID NO:4) (i.e., a peptide 
derived from the NR Box II region of the pi 60 coactivator GRIP1) bound to the coactivator 
binding site and the agonist. DES. Also revealed is the structure of the ERa LBD co-crystallized 
with the antagonist, OHT. The Examples provide the 2.03 A resolution crystal structure of the 

20 hERa LBD bound to DES and the coactivator and the 1 .9A x-ray crystal structure of the hERa 
LBD bound to OHT, i.e.. the crystals defract with at least 2.03 A or 1 .9A resolution, respectively. 

In yet another aspect of the invention, compounds of interest are discovered, i.e., agonists 
or antagonists of ligand binding are identified, by a method for identifying an agonist or 
antagonist of ligand binding to a nuclear receptor. The method comprises the steps of providing 

25 the atomic coordinates of the ERa LBD or portion thereof to a computerized modeling system, 
modeling compounds which fit spatially into the LBD, and identifying in an assay for nuclear 
receptor activity a compound which increases or decreases the activity of the nuclear receptor by 
binding the LBD of the nuclear receptor. Preferably, the atomic coordinates are of the amino 
acid residues corresponding to residues of human estrogen receptor a Met343. Leu346, Ala350. 

30 Glu353, Leu384. Leu387. Leu391. Arg394. Phe404, Met421, Leu428. Gly521, His524, Leu525 
and Met528, preferably Met343. Met421, His524. Leu525 and Met528. 
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Compounds of particular interest fit spatially and preferentially into the ligand binding 
domain. By "its spatially and preferentially" is intended that a compound possesses a three- 
dimensional structure and conformation for selectively interacting with a nuclear receptor LBD. 
Compounds that fit spatially and preferentially into the LBD interact with amino acid residues 
5 forming the hydrophobic cleft of this site. The present invention also includes a method for 

identifying a compound capable of selectively modulating nuclear receptor activity. The method 
comprises the steps of modeling test compounds that fit spatially and preferentially into the LBD 
of a nuclear receptor of interest using an atomic structural model of a nuclear receptor, screening 
the test compounds in an assay for nuclear receptor activity characterized by preferential binding 

10 of a test compound to the LBD of a nuclear receptor, and identifying a test compound that 

selectively modulates the activity of a nuclear receptor. Such receptor-specific compounds are 
selected that exploit differences between the LBDs of one type of nuclear receptor versus a 
second type of nuclear receptor. 

The invention also is applicable to generating new compounds that distinguish nuclear 

15 receptor isoforms. This can facilitate generation of either tissue-specific or function-specific 
compounds. For instance, GR subfamily members have usually one receptor encoded by a 
single gene, although there are exceptions. For example, there are two PR isoforms, A and B, 
translated from the same mRNA by alternate initiation from different AUG codons. There are 
two GR forms, one of which does not bind ligand. This method is especially applicable to the 

20 ER subfamily which usually has several receptors that are encoded by at least two (ER: a, p) 
genes or have alternate RNA splicing. 

The receptor-specific compounds of the invention preferably interact with 
conformationally constrained residues of the LBD that are conserved among one type of nuclear 
receptor compared to a second type of nuclear receptor. "Conformationally constrained" is 

25 intended to refer to the three-dimensional structure of a chemical or moiety thereof having 
certain rotations about its bonds fixed by various local geometric and physical-chemical 
constraints. Conformationally constrained structural features of a LBD include residues that 
have their natural flexible conformations fixed by various geometric and physical-chemical 
constraints, such as local backbone, local side chain, and topological constraints. These types of 

30 constraints are exploited to restrict positioning of atoms involved in receptor-coactivator 
recognition and binding. 
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The present invention also provides for a computational method using three dimensional 
models of nuclear receptors based on crystals of nuclear receptors. Generally, the computational 
method of designing a nuclear receptor ligand determines which amino acid or amino acids of a 
nuclear receptor LBD interact with a chemical moiety (at least one) of the ligand using a three 
5 dimensional model of a crystallized protein comprising a nuclear receptor LBD with a bound 
ligand, and selecting a chemical modification (at least one) of the chemical moiety to produce a 
second chemical moiety with a structure that either decreases or increases an interaction between 
the interacting amino acid and the second chemical moiety compared to the interaction between 
the interacting amino acid and the chemical moiety. In the instant invention, crystal structures of 

10 the hERct with DES/pcptide and with OHT, have shown that amino acid residues that correspond 
to hERa Met343. Leu346, Ala350 ? Glu353, Leu384. Leu387, Leu391, Arg394, Phe404. Met421, 
Leu428, Gly521, His524. Leu525 and Met528, preferably Met343, Met42U His524. Leu525 
and/or Met528. interact with at least one chemical moiety on the ligand. 

Accordingly, one embodiment of the invention is a computational method of designing a 

15 nuclear receptor ligand where at least one amino acid residue of a nuclear receptor LBD that 

corresponds to human estrogen receptor a Met343, Leu346, Ala350, Glu353, Leu384, Leu387, 
Leu39U Arg394, Phe404, Met421, Leu428, Gly521. His524, Leu525 and Met528, preferably 
Met343, Met42L His524, Leu525 and Met528, interacts with at least one first chemical moiety 
of the ligand. comprising the step of selecting at least one chemical modification of the first 

20 chemical moiety to produce a second chemical moiety with a structure to either decrease or 
increase an interaction between the interacting amino acid and the second chemical moiety as 
compared to the interaction between the interacting amino acid and the first chemical moiety. 

This computational method may further comprise determining a change in interaction 
between the interacting amino acid and the ligand after chemical modification of the first 

25 chemical moiety. The chemical modification can either enhance or reduce hydrogen bonding 
interaction, charge interaction, hydrophobic interaction. Van Der Waals interaction or dipole 
interaction between the second chemical moiety and the interacting amino acid as compared to 
the interaction between the first chemical moiety and the interacting amino acid. 

Chemical modifications will often enhance or reduce interactions an atom of a LBD 

30 amino acid and an atom of an LBD ligand. Steric hindrance will be a common means of 
changing the interaction of the LBD binding cavity with the activation domain. Chemical 
modifications are preferably introduced at C-H, C- and C-OH position in ligands. where the 
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carbon is part of the ligand structure which remains the same after modification is complete. In 
the case of C-H, C could have 1. 2 or 3 hydrogens, but usually only one hydrogen will be 
replace. The H or OH are removed after modification is complete and replaced with the desired 
chemical moiety. 

5 Such chemical modifications would preferably involve the addition of substituents. onto 

any of the free carbons of the A' ring of DES positioning these substituents to collide or bind 
preferentially with one or more residues that correspond to hERa Met343. Lcu346. Ala350, 
Glu353, Leu384, Leu387, Leu391 . Arg394, Phe404, Met421, Leu428, Gly52L His524. Leu525 
and Met528, preferably Met343. Met421. His524, Leu525 or Met528, Typical substituents are 

10 hydrophobic groups, including by way of example and not limitation, alkyl groups such as ethyl, 
propyl, isopropyl, etc., and aromatic groups such as benzyl, etc. 

In practice, one would start with a known ligand for the nuclear receptor of interest as the 
chemical backbone, upon which to base agonist/antagonist design. The known ligand would be 
modified as described above. For example 1 7p-estradiol is an endogenous ligand for the hERa. 

15 In the case of estradiol, positions of interest are C6a, C7a, CI 2a, CI 5a, CI 6a and CI 7a. 

Modifications at one or more of these free carbons on 17p-estradiol's backbone would affect the 
ligand's interactions with one or more of the Met343, Leu346, Ala350, Glu353. Leu384, Leu387, 
Leu39L Arg394, Phe404, Met421, Leu428, Gly521, His524, Leu525 and Met528, preferably 
Met343, Met421, His524, Leu525 and Met528 residues, either providing for enhancing 

20 interaction, which would be the basis for agonist design, or reduced interaction, which would be 
the basis for antagonist design. 

- Other chemical backbones of other known ligands could be used in a similar manner. 
For example, other known agonists include diethylstilbestrol (synthetic), moxestrol (synthetic), 
mesohexestrol (synthetic), coumestrol (clover), A 9 -THC (cannabis), o,p-DDT (insecticide), 

25 zearalenone (fungal) and kepone (insecticide). Known estrogen receptor antagonists include the 
ICI series of modified steroids such as ICI 164,384 and EM800. Known SERM's include 
tamoxifen, raloxifene and GW5638. 

Alternatively known agonists could be positioned in the ligand binding pocket through 
computational or manual docking. Positions for substitution would then be selected based on the 

30 predicted binding orientation of these compounds. In addition, hybrid molecules could be 
generated that also possessed side chains that prevented helix 12 from adopting the agonist- 
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bound position. Novel SERMs can be produced by varying the strength of two different effects: 
the helix 1 2 displacement and the secondary structure disorganization. 

Previous efforts to make antagonists have involved attachment of large bulky substituents 
to agonists, typically through trial and error, and the drug design methods described herein 
provide an alternative strategy of ligand design that may be critical for developing new potential 
antagonists. 

For modeling, docking algorithms and computer programs that employ them can be used 
to identify compounds that fit into the ligand binding domain. For example, docking programs 
can be used to predict how a molecule of interest can interact with the nuclear receptor LBD. 
Fragment-based docking also can be used in building molecules de novo inside the LBD, by 
placing chemical fragments that complement the site to optimize intermolecular interactions. 
The techniques can be used to optimize the geometry of the binding interactions. This design 
approach has been made possible by identification of the LBD structure thus, the principles of 
molecular recognition can now be used to design a compound which is complementary to the 
structure of this site. Compounds fitting the LBD serve as a starting point for an iterative design, 
synthesis and test cycle in which new compounds are selected and optimized for desired 
properties including affinity, efficacy, and selectivity. For example, the compounds can be 
subjected to addition modification, such as replacement and/or addition of R-group substituents 
of a core structure identified for a particular class of binding compounds, modeling and/or 
activity screening if desired, and then subjected to additional rounds of testing. 

Computationally small molecule databases can be screened for chemical entities or 
compounds that can bind in whole, or in part, to a nuclear receptor ligand binding domain of 
interest. In this screening, the quality of fit of such entities or compounds to the binding site may 
be judged either by shape complementarity (DesJalais, et al., J. Med . Chem . 31:722-729 (1988)) 
or by estimated interaction energy (Meng, et al., J. Comp . Chem . 13:505-524 (1992)). The 
molecule databases include any virtual or physical database, such as electronic and physical 
compound library databases, and are preferably used in developing compounds that modulate 
coactivator binding. 

Compounds can be designed intelligently by exploiting available structural and 
functional information by gaining an understanding of the quantitative structure-activity 
relationship (QSAR), using that understanding to design new compound libraries, particularly 
focused libraries having chemical diversity of one or more particular groups of a core structure. 
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and incorporating any structural data into that iterative design process. For example, one skilled 
in the art may use one of several methods to screen chemical entities or fragments for their 
ability to associate with the ligand binding domain of a nuclear receptor of interest. This process 
may begin by visual inspection of. for example, the LBD on the computer screen. Selected 
5 fragments or chemical entities may then be positioned into all or part of the site. Docking may 
be accomplished using software such as Quanta and Sybyl, followed by energy minimization and 
molecular dynamics with standard molecular mechanics force-fields, such as CHARMM and 
AMBER. 

Specialized computer programs may also assist in the process of selecting chemical entity 
10 fragments or whole compounds. These include: GRID (Goodford. J. Med . Chem . 28:849-857 
(1985), available from Oxford University, Oxford, UK); MCSS (Miranker. et aL "Proteins: 
Structure, Function and Genetics" 1 1 :29-34 (1991), available from Molecular Simulations, 
Burlington. MA); AUTODOCK (Goodsell, et al., "Proteins: Structure, Function and Genetics" 
8:195-202 (1990), available from Scripps Research Institute. La Jolla, CA); and DOCK (Kuntz, 
1 5 et al, J. Mol . Biol . 1 6 1 :269-288 ( 1 982), available from University of California, San Francisco, 
CA). 

Additional commercially available computer databases for small molecular compounds 
include Cambridge Structural Database and Fine Chemical Database (Rusinko, Chem . Des. 
Auto . News 8:44-47 (1993)). 

20 Once suitable chemical entities or fragments have been selected, they can be assembled 

into a single compound. Assembly may be proceeded by visual inspection of the relationship of 
the fragments to each other on the three-dimensional image displayed on a computer screen in 
relation to the structure coordinates of a nuclear receptor. This can be followed by manual 
model building using software such as Quanta or Sybyl. 

25 Useful programs to aid one of skill in the art in connecting the individual chemical 

entities or fragments include: CAVEAT (Bartlett, et al., "CAVEAT: A Program to Facilitate the 
Structure-Derived Design of Biologically Active Molecules", in Molecular Recognition in 
Chemical and Biological Problems, Special Pub., Royal Chem. Soc. 78:182-196 (1989), 
available from the University of California. Berkeley, CA); 3D Database systems such as 

30 MACCS-3D (MDL Information Systems. San Leandro. CA. reviewed in Martin. J. Med. Chem. 
35:2145-2154 (1992)); and HOOK (available from Molecular Simulations. Burlington. MA). 

21 
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In addition to building a compound in a step-wise fashion, one fragment or chemical 
entity at a time as described above, compounds that bind to a ligand binding domain of interest 
also may be designed as a whole or de novo using either an empty LBD or optionally including 
some ponion(s) of a molecule known to bind to the site, such as a known ligand. These methods 
5 include: LUDI (Bohm. J. Comp . Aid . Molec . Design 6:61-78 (1992), available from Biosm 
Technologies, San Diego, CA); LEGEND (Nishibata. et al., Tetrahedron 47:8985 (1991), 
available from Molecular Simulations, Burlington, MA); and LeapFrog (available from Tripos 
Associates, St. Louis, MO). 

Other molecular modeling techniques may also be employed in accordance with this 

1 0 invention. See, for example, Cohen, et al., J. Med . Chem . 33:883-894 ( 1 990) and Navia, et al., 
Current Opinions in Structural Biology 2:202-210 (1992). For example, where the structures of 
test compounds are known, a model of the test compound may be superimposed over the model 
of the structure of the invention. Numerous methods and techniques are known in the art for 
performing this step, any of which may be used. See, for example, Farmer, "Drug Design' 1 

15 10:1 19-143, Ariens, ed., Academic Press, New York (1980); U.S. Patent No. 5,331,573; U.S. 
Patent No. 5,500,807; Verlinde, Structure 2:577-587 (1994); and Kuntz, et al., Science 
257:1078-1082 (1992). The model building techniques and computer evaluation systems 
described herein are not a limitation on the present invention. 

Using these computer modeling systems a large number of compounds may be quickly 

20 and easily examined and expensive and lengthy biochemical testing avoided. Moreover, the 
need for actual synthesis of many compounds can be substantially reduced and/or effectively 
eliminated. 

Compounds identified through modeling can be screened in assays such as are well 
known in the art. Such assays, which include biological assays, are characterized by binding of 

25 the compound to a ligand binding domain of interest for ligand binding activity. Screening can 
be, for example, in vitro, in cell culture, and/or in vivo. Biological screening preferably centers 
on activity-based response models, binding assays (which measure how well a compound binds 
to the receptor), and bacterial, yeast and animal cell lines (which measure the biological effect of 
a compound in a cell). The assays can be automated for high capacity-high throughput screening 

30 (HTS) in which large numbers of compounds can be tested to identify compounds with the 
desired activity. 
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As an example, in vifro binding assays can be performed in which compounds are tested 
for their ability to block the binding of a ligand. fragment, fusion or peptide thereof, to a ligand 
binding domain of interest. For cell and tissue culture assays, they may be performed to assess a 
compound's ability to block function of cellular coactivators. such as members of the pi 60 
5 family of coactivator proteins, such as SRC-1. AIB1, RAC3. p/CIP, and GRIP1 and its 

homologues TIF 2 and NcoA-2, and those that exhibit receptor and/or isoform-specific binding 
affinity. In a preferred embodiment, compounds of the invention bind to a ligand binding 
domain with greater affinity than the endogenous ligands. Tissue profiling and appropriate 
animal models also can be used to select compounds. Different cell types and tissues also can be 

10 used for these biological screening assays. Suitable assays for such screening are described 

herein and in Shibata, et al.. Recent Prog . Horm . Res. 52:141-164 (1997); Tagami, et al., Mol . 
Cell . Biol . 17(5):2642-2648 (1997); Zhu, et al., J. Biol. Chem. 272(1 4):9048-9054 (1997); Lin, 
et al., Mol. CeH. BioL 1 7(10):61 3 1-6138 (1997); Kakizawa, et al., J. Biol . Chem . 272(38):23799- 
23804 (1997); and Chang, et al., Proc . Natl . Acad . Sci. USA 94(17):9040-9045 (1997), which 

1 5 references are incorporated herein in their entirety by reference. 

The compounds selected can have agonist and/or antagonistic properties. The 
compounds also include those that exhibit new properties with varying mixtures of agonist and 
antagonist activities, depending on the effects of altering ligand binding in the context of 
different activities of nuclear receptors, either hormone-dependent or hormone-independent, 

20 which are mediated by proteins other than coactivators, and which interact with the receptors at 
locations other than the coactivator binding site. The compounds also include those, which 
through their binding to receptor locations that are conformationally sensitive to hormone 
binding, have allosteric effects on the receptor by stabilizing or destabilizing the hormone-bound 
conformation of the receptor, or by directly inducing the same, similar, or different 

25 conformational changes induced in the receptor by the binding of hormone. 

Of particular interest is use of such compounds in a method of modulating nuclear 
receptor activity in a mammal by administering to a mammal in need thereof a sufficient amount 
of a compound that fits spatially and preferentially into a ligand binding domain of a nuclear 
receptor of interest. By "modulating" is intended increasing or decreasing activity of a nuclear 

30 receptor. For example, pre-clinical candidate compounds can be tested in appropriate animal 
models in order to measure efficacy, absorption, pharmacokinetics and toxicity following 
standard techniques known in the art. Compounds exhibiting desired properties are then tested 
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in clinical trials for use in treatment of various nuclear receptor-based disorders. These include 
ER-based disorders, such as postmenopausal symptoms and cancer resulting from loss of 
estrogen production, and osteoporosis and cardiovascular disease stemming from traditional 
estrogen replacement therapy. Others include GR-based disorders including Type II diabetes 
and inflammatory conditions such as rheumatic diseases. 

Although for many nuclear receptors, the goal is to discover novel synthetic agonists or 
antagonists, it is important to realize the value for some nuclear receptors, especially the estrogen 
receptor, of developing compounds that have the desired agonist and antagonist effects in target 
tissues. Such compounds can be discovered and/or designed by the methods described herein, 
then screened for tissue specificity by methods that are well known in the art. For example, there 
is a great need for the improvement of existing therapies and the development of new agonists 
that act like estrogen in cardiovascular and brain tissue and bone, and new antagonists that act 
upon the estrogen receptor in uterine and breast tissue. Ideally, a compound will have more than 
one of these traits, i.e., a compound will act as an agonist in one tissue, while acting as an 
antagonist in another tissue. While the tissue-selective antagonism of SERMs such as OHT and 
RAL is the result of numerous factors (Grainger, et al.. Nature Medicine 2(4):38 1-385 (1996); 
Grese, et al., supra : and Jordan, J. Natl . Cancer Inst . 90:967-71 (1998)), dissection of the 
mechanisms of action of these ligands requires a comprehensive understanding of how they act 
on the LBD and regulate its interactions with other cellular factors. The instant invention shows, 
unexpectedly, that ligand-mediated structural perturbations in and around the ligand binding 
pocket, and not simply side chain effects, contribute to receptor antagonism. Accordingly, by 
adjusting the balance between these two effects provides a novel strategy for the design of 
improved SERMs. 

With this knowledge, it is of particular interest to design therapeutic compounds that will 
distort at least one amino acid residue corresponding to residues of human estrogen receptor a 
Met343, Leu346, Ala350, Glu353, Leu384, Leu387, Leu391, Arg394. Phe404, Met421, Leu428, 
Gly521, His524, Leu525 and Met528, preferably Met343, Met421, His524, Leu525 and Met528. 
Accordingly, one aspect of the invention is a method of modulating nuclear receptor activity in a 
mammal by administering to a mammal in need thereof a sufficient amount of a ligand that fits 
spatially and preferentially into a ligand binding domain of a nuclear receptor of interest, 
wherein the ligand is designed by a computational method where at least one amino acid residue 
of a nuclear receptor LBD that corresponds to hERa Met343, Leu346. Ala350. Glu353, Leu384, 
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Leu387. Leu391 . Arg394. Phe404. Met421, Leu428. GIy52K His524, Leu525 and Met528. 
preferably Met343. Met421. His524, Leu525 and Met528. interacts with at least one first 
chemical moiety of the ligand. Such a method involves selecting at least one chemical 
modification of the first chemical moiety to produce a second chemical moiety with a structure 
5 that either decreases or increases an interaction between the interacting amino acid and the 

second chemical moiety as compared to the interaction between the interacting amino acid and 
the first chemical moiety. 

Compounds designed by this method can be either agonists or antagonists and the method 
of modulating nuclear receptor activity can comprise administering an antagonist alone, an 

1 0 agonist alone or an agonist in combination with a coactivator or a compound that mimics a 
coactivator by binding to the coactivator binding site. 

The coactivator can be a known coactivator. The coactivator mimic can be designed by a 
computational method where at least one amino acid residue of a nuclear receptor coactivator 
binding site that corresponds to hERa helix 3 residues Leu354, Val355, Met357, Ile358, Ala361 

1 5 and Lys362, helix 4 residue Phe367, helix 5 residues Gln375, Val376, Leu379 and Glu380, helix 
6 residue Trp383. and helix 12 residues Asp538, Leu539. Glu542, Met543 and Leu544, interacts 
with at least one first chemical moiety of the coactivator mimic. The method involves selecting 
at least one chemical modification of the first chemical moiety to produce a second chemical 
moiety with a structure that either decreases or increases an interaction between the interacting 

20 amino acid and the second chemical moiety as compared to the interaction between the 
interacting amino acid and the first chemical moiety. 

Use of an agonist in combination with a coactivator or coactivator mimic also provides a 
unique strategy for delivering therapeutics that have novel tissue-specific effects. For example, 
coactivator mimics can be designed to bind into the site involved in transcriptional activity only 

25 when helix- 12 is in its agonist bound state. If such coactivator mimics are specific for this site of 
a particular receptor, it is possible to selectively inhibit that receptor only in the presence of 
agonist. This could lead to novel, tissue specific antagonism based on the levels of endogenous 
agonists. Agonists designed by the methods of the instant invention could be used in assay to 
determine the specificity of coactivator mimics. Alternatively, the effective levels in a given 

30 tissue could be modulated by giving known antagonists or antagonists designed by the methods 
of the instant invention. The crystal structure of the LBD/DES/GRIP1 peptide complex, 
described herein, precisely defines the binding site that would need to be targeted. 

25 
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As noted above and as exemplified in the Examples, ER LBDs are co-crystallized with a 
peptide molecule comprising a coactivator GRIP1 NR Box II peptide sequence (SEQ ID NO:4) 
bound to the coactivator binding site and DES with the cocrystai structure refined to a resolution 
of 2.03 A and co-crystallized with OHT with the cocrystai structure refined to a resolution of 
1 .9A. Accordingly, the invention also provides for cocrystals made from nuclear receptor ligand 
binding domains with a ligand bound to the ligand binding domain and a molecule bound to the 
coactivator binding site. Preferably the cocrystai structure is refined to a resolution greater than 
3.6A, i.e.. having a resolution value less than 3.6A. More preferably the cocrystai structure is 
refined to greater than 3.4A. 3.2A, 3.0A. 2.8A, 2.6A, 2.4A, 2.2A, even more preferably to a 
resolution greater than 2.03A. The invention further provides for cocrystals made from nuclear 
receptor ligand binding domains with a ligand bound to the ligand binding domain. Preferably 
the cocrystai structure is refined to a resolution greater than 3.6A. i.e., having a resolution value 
less than 3. 6 A. More preferably the cocrystai structure is refined to greater than 3.4A, 3.2A, 
3.0A, 2.8A, 2.6A, 2.4A, 2.2A, 2.0A, even more preferably to a resolution greater than 1 .9A. 

Crystals are made from purified nuclear receptor LBDs that are usually expressed by a 
cell culture, such as E. coli. E. coli is often a preferred expression system. The thyroid receptor 
was successfully expressed in E. coli in Apriletti, et ah, supra . However, it has long been 
believed that a human heat shock protein was required for successful recombinant expression of 
the estrogen receptor. Therefore, it was quite unexpected to find that the estrogen receptor could 
be expressed as an active protein in E. coli. 

Preferably, different crystals (cocrystals) for the same nuclear receptor are separately 
made using different coactivators-type molecules, such as protein fragments, fusions or small 
peptides. The coactivator-type molecules preferably contain NR-box sequences necessary for 
binding to the coactivator binding site, or derivatives of NR-box sequences. Other molecules 
can be used in co-crystallization, such as small organics that bind to the coactivator or hormone 
binding site(s). Heavy atom substitutions can be included in the LBD and/or a co-crystallizing 
molecule. 

After the three dimensional structure of the cocrystai is determined, the structural 
information can be used in computational methods to design synthetic compounds for the nuclear 
receptor, and further structure-activity relationships can be determined through routine testing 
using the assays described herein and known in the art. 
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Since nuclear receptor LBDs may crystallize in more than one crystal form the structure 
coordinates of such receptors or portions thereof, as provided in Appendices 1 and 2. are 
particularly useful to solve the structure of those other crystal forms of nuclear receptors. They 
may also be used to solve the structure of mutants or co-complexes of nuclear receptors having 
5 sufficient homology. 

One method that may be employed for this purpose is molecular replacement. In this 
method, the unknown crystal structure, may be determined using the structure coordinates of this 
invention as provided in Appendices 1 and 2. The Appendix 1 coordinates for the DES-ERa 
LBD-GRIP1 NR Box II peptide complex and for the Appendix 2 coordinates for the OHT-ERa 
10 LBD complex have been deposited with the Brookhaven National Laboratory Protein Data 

Bank, and have been assigned Brookhaven Protein Data Bank Accession Numbers 2erd and 2ert, 
respectively. This method will provide an accurate structural form for the unknown crystal more 
quickly and efficiently than attempting to determine such information ab initio. 

Atomic coordinate information gleaned from the crystals of the invention can be stored. 
15 In a preferred embodiment, the information is provided in the form of a machine-readable data 
storage medium. This medium contains information for constructing and/or manipulating an 
atomic model of a ligand binding domain or portion thereof. For example, the machine readable 
data for the ligand binding domain comprises structure coordinates of amino acids corresponding 
to hERa Met343. Leu346, Ala350, Glu353, Leu384, Leu387, Leu391, Arg394. Phe404, Mct42K 
20 Leu428, GIy521, His524, Leu525 and Met528, preferably Met343, Met42K His524. Leu525 and 
Met528, or a homologue of the molecule or molecular complex comprising the site. The 
homologues comprise a LBD that has a root mean square deviation from the backbone atoms of 
the amino acids of not more than 1.5A. A preferred molecule or complex represents a compound 
bound to the LBD. 

25 The machine-readable data storage medium can be used for interactive drug design and 

molecular replacement studies. For example, a data storage material is encoded with a first set 
of machine-readable data that can be combined with a second set of machine-readable data. For 
molecular replacement, the first set of data can comprise a Fourier transform of at least a portion 
of the structural coordinates of the nuclear receptor or portion thereof of interest, and the second 

30 data set comprises an X-ray diffraction pattern of the molecule or molecular complex of interest. 
Using a machine programmed with instructions for using the first and second data sets a portion 
or all of the structure coordinates corresponding to the second data can be determined. 
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Protein for crystals and assays described herein can be produced using expression and 
purification techniques described herein and known in the art. For example, high level 
expression of nuclear receptor LBDs can be obtained in suitable expression hosts such as E. coli. 
Expression of LBDs in E. coli. for example, includes the ERa LBD and other nuclear receptors. 
5 including GR and PR. Yeast and other eukaryotic expression systems can be used with nuclear 
receptors that bind heat shock proteins as these nuclear receptors are generally more difficult to 
express in bacteria, with the exception of ER, which can be expressed in bacteria. 
Representative nuclear receptors or their ligand binding domains have been cloned and 
sequenced: human ER (as described in Seielstad, et al., Molecular Endocrinology 9(6):647-658 

10 (1995), incorporated herein by reference), human GR, and human PR. The LBD for each of 
these receptors has been identified. 

Coactivator proteins can be expressed using techniques known in the an, particularly 
members of the pi 60 family of coactivator proteins that have been cloned and/or expressed 
previously, such as SRC-1, AIB1, RAC3, p/CIP, and GRIP1 and its homologues TIF 2 and 

15 NcoA-2. A preferred method for expression of coactivator protein is to express a fragment that 
retains transcriptional activation activity using the "Song and Fields" method (also referred to as 
the "yeast 2-hybrid" method) as described in publications by Hong, et al., Mol . Cell , Biol . 
17:2735-44 (1997) and Proc. Natl . Acad . Sci. USA 93(10):4948-52 (1996)), for GRIP1 
expression, which references are incorporated herein by reference. 

20 The proteins can be expressed alone, as fragments of the mature or full-length sequence, 

or as fusions to heterologous sequences. For example. ERa can be expressed without any 
portion of the DBD or amino-terminal domain. Portions of the DBD or amino-terminus can be 
included if further structural information with amino acids adjacent the LBD is desired. 
Generally, for the ERa the LBD used for crystals will be less than 320 amino acids in length. 

25 Preferably, the ERa LBD will be at least 220 amino acids in length and most preferably at least 
250 amino acids in length. For example the LBD used for crystallization can comprise amino 
acids spanning from 297 to 554 of the ERa. 

Typically the LBDs are purified to homogeneity for crystallization. Purity of LBDs can 
be measured with sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE). 

30 mass spectrometry (MS) and hydrophobic high performance liquid chromatography (HPLC). 
The purified LBD for crystallization should be at least 97.5 % pure, preferably at least 99.0% 
pure, and more preferably at least 99.5% pure. 



WO 99/50658 



POYUS99/06937 



Initially, purification of the unliganded receptor can be obtained by conventional 
techniques, such as hydrophobic interaction chromatography (HPLC). ion exchange 
chromatography (HPLC), and heparin affinity chromatography. To achieve higher purification 
for improved crystals of nuclear receptors, especially the estrogen receptor, the receptors can be 
5 ligand-shift-purified using a column that separates the receptor according to charge, such as an 
ion exchange or hydrophobic interaction column, and then bind the eluted receptor with a ligand. 
especially an agonist. The ligand induces a change in the receptor's surface charge such that the 
liganded receptor elutes at a different position than the unliganded receptor. Usually saturating 
concentrations of ligand are used in the column and the protein can be preincubated with the 
10 ligand prior to passing it over the column. The structural studies detailed herein indicate the 
general applicability of this technique for obtaining super-pure nuclear receptor LBDs for 
crystallization. 

Purification can also be accomplished by use of a purification handle or "tag," such as 
with a histidine amino acid engineered to reside on the end of the protein, such as on the N- 

15 terminus, and then using a nickel or cobalt chelation column for purification. (Janknecht, Proc. 
Natl . Acad . Sci . USA . 88:8972-8976 (1991)) incorporated by reference. Typically purified 
LBD, such as ERa LBD, is equilibrated at a saturating concentration of ligand at a temperature 
that preserves the integrity of the protein. Ligand equilibration can be established between 2 and 
37°C, although the receptor tends to be more stable in the 2-20°C range. Preferably crystals are 

20 made with the hanging drop methods detailed herein. Regulated temperature control is desirable 
to improve crystal stability and quality. Temperatures between 4 and 25°C are generally used 
and it is often preferable to test crystallization over a range of temperatures. The crystals are 
then subjected to vapor diffusion and bombarded with x-rays to obtain x-ray diffraction pattern 
following standard procedures. 

25 For co-crystallization with a ligand that binds the ligand binding domain, alone or in 

conjunction with a peptide that binds to the coactivator binding site, various concentrations of 
ligands and peptides containing a sequence that binds to a coactivator binding site of a nuclear 
receptor of interest can be used in microcrystallization trials, and the appropriate compounds 
selected for further crystallization. Ligands and peptides can be assayed for binding to the ligand 

30 binding domain and coactivator binding sites of a nuclear receptor of interest by any number of 
techniques, including those assays described herein. For crystallization trials with the ERa 
LBD. the hanging drop vapor diffusion method is preferred. Conditions of pH. solvent and 
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solute components and concentrations and temperature can be adjusted, for instance, as 
described in the Examples. In the handing drop method, to obtain suitable crystals for x-ray 
diffraction analysis, seeding of prepared drops with microcrystals of the complex can be used. 
Collection of structural information can be determined by molecular replacement using the 
5 structure of the ERa LBD determined herein. The structure is refined following standard 
techniques known in the art. 

There are many uses and advantages provided by the present invention. For example, the 
methods and compositions described herein are useful for identifying peptides, peptidomimetics 
or small natural or synthetic organic molecules that modulate nuclear receptor activity. The 
10 compounds are useful in treating nuclear receptor-based disorders. Methods and compositions of 
the invention also find use in characterizing structure/function relationships of natural and 
synthetic ligands. 

The following discussion of provides an understanding of the basis for the examples, 
along with the results obtained and/or the conclusions reached. 

15 As described above, many coactivators recognize agonist bound nuclear receptor LBDs 

through the sequence LXXLL (SEQ ID NO:l), where L is leucine and X is any amino acid (the 
NR box). The structure of the DES-hERcc LBD-GRIP1 peptide complex reveals that the 
LXXLL motif (SEQ ID NO:l) forms the core of a short amphipathic a helix which is recognized 
by a highly complementary hydrophobic groove on the surface of the receptor. In agreement 

20 with the conclusions of other mutational and structural studies (Brzozowski, et al., supra and 
Feng, et al., supra), it is believed that this peptide binding groove formed by residues from 
helices 3, 4, 5 and 12 and the turn between helices 3 and 4 is the surface of ERa involved with 
transcriptional activity, i.e., the coactivator binding site. Further, structural studies of the 
complex between TRp and the GRIP1 NR box II peptide and biochemical studies of GRIP 1 

25 binding to TRP and GR (Darimont, et al., supra ) and study of the general features of the 

PPARy/SRC-1 peptide complex (Nolte, et al., supra ) are similar to those of the ERct/GRIPl NR 
box II peptide complex describied herein, suggesting that the mechanisms of NR box recognition 
are conserved across the nuclear receptor family. 

Of the eleven AF-2 residues whose side chains interact with the coactivator helix (Figure 

30 3A), only four (Lys 362. Leu 379. Gin 375 and Glu 542) are highly conserved across the nuclear 
receptor family (Wurtz. et al.. supra ). The side chains of Gin 375 and Leu 379 are 
predominantly buried even in the absence of GRIP1 binding and appear to form integral pans of 

30 
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the architecture of the surface involved in transcriptional activity. In contrast, the side chains of 
Lys 362 and GIu 542 are largely solvent exposed in the absence of coactivator and make both 
nonpolar contacts and the only receptor-mediated polar interactions with the coactivator helix. 
These two capping interaction residues arc perfectly positioned at opposite ends of the 
5 coactivator binding site groove not only to stabilize the main chain conformation of the 

coactivator but also to function as a molecular caliper; the 15 A distance between Lys 362 and 
Glu 542 is well suited to measure off the -1 1 A axial length of the short, two-turn coactivator a 
helix (Figure 3C). Similar receptor-mediated capping interactions have also been observed in a 
complex between the TRp LBD and the NR box II peptide (Darimont, et al., supra ). Mutation of 
1 0 either of these two residues severely cripples coactivator binding by ERa as well as by TR(i (see 
the Examples, Apriletti, et aL supra and Feng, et al., supra and Henttu, et al.. supra ). Hence, the 
formation of helix capping interactions may be a general feature of coactivator recognition by 
nuclear receptors. 

The side chains of six hydrophobic AF-2 residues (He 358, Leu 372, Val 376, Leu 379, 
1 5 Leu 539 and Met 543) form a large part of the highly cooperative network of van der Waals 

contacts made by the receptor with the hydrophobic face of the coactivator hejix (Figures 3 A and 
3C). Mutations in He 358, Val 376 and Leu 539 abrogate GRIP1 binding (see the Examples and 
Feng, et al., supra ). Although these residues are, in general, more poorly conserved across 
nuclear receptors than either Lys 362 or Glu 542, their hydrophobic character, with the exception 
20 of Leu 372, is conserved. Since the different NR LBDs adopt the same overall fold (Moras, et 
al.. Curr. Qpin . Cell Biol . 10:384-91 (1998)), it follows that the hydrophobic regions of different 
nuclear receptor coactivator binding site surfaces are distinctly textured. In support of this 
hypothesis, the NR box II peptide used in crystallization inhibited binding of GRIP1 to the LBDs 
of the ERa, the TRp and the glucocorticoid receptor (GR) with very different efficiencies (Ding, 
25 et al., supra ). 

The hydrophobic face of the NR box helix is formed by the side chains of the three motif 
leucines and the isoleucine preceding the motif (He 689). The functional importance of the 
conserved leucines in receptor binding has been demonstrated by numerous in vitro and in vivo 
studies. In contrast, the role of the residue preceding the motif in receptor binding has been 
30 poorly characterized. Both biochemical and structural data implicate He 689 as a key receptor 
binding determinant. In the crystal, only the side chains of the motif leucines and He 689 
extensively contact the LBD in both noncrystallographic symmetry related peptides. Mutation of 
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He 689 to alanine reduces the ability of the peptide to inhibit the binding of GRIP1 to ERct by 
-30 fold in a competition assay (data not shown). The side chain of He 689 lies in a rather 
chemically distinct environment: this residue forms van der Waals contacts with the aliphatic 
portion of the Asp 538 side chain, the side chain of Leu 539 and the y-carboxylate of Glu 542 
5 (Figures 1 A and 3A). The proximity of this negatively charged moiety of Glu 542 to the 

hydrophobic side chain of He 689 should enhance the electrostatic potential of the side chain 
carboxylate and strengthen its stabilizing interactions with the N-terminus of the coactivator 
helix. Despite its apparently crucial role in receptor recognition, the identity of the residue 
immediately preceding known NR boxes is poorly conserved. This sequence variability should 

1 0 have effects not only on packing interactions with ERct but also on both the chemical 

environment and the critically important orientation of Glu 542. This should in turn translate 
into variations in affinity for the receptor. Indeed, the three NR boxes from GRIP1 . which each 
contain a different residue preceding the LXXLL motif (SEQ ID NO:l), have differing affinities 
for ERa (Ding, et al., supra ; Voegel, et al., EMBO J. 17:507-19 (1998)). 

1 5 The NR boxes of TIF2, the human homologue of GRIP1 , have been found to be partially 

functionally redundant despite their individual differences in receptor binding affinities. All 
three of NR boxes of TIF2 must be mutated to completely eliminate interaction with the ERa 
(Voegel, et al., EMBO J. 17:507-19 (1998)). Our data indicate that a single NR box peptide is 
sufficient to form a tight complex with a single ERa LBD. Yet pi 60 coactivators possess 

20 multiple NR boxes. A possible explanation for the presence of multiple NR boxes is that they 

provide coactivators with broad specificity. Receptor binding relies upon the intricate formation 
of multiple van der Waals interactions yet the various nuclear receptors appear to have different 
coactivator binding site surfaces. The different amino acids in the position immediately 
preceding the LXXLL motif (SEQ ID NO:l) might allow some degree of adaptability to these 

25 distinct surfaces; however, there may be no NR box sequence that is capable of efficiently 
binding to all nuclear receptors. Multiple NR boxes may therefore provide coactivators the 
diversity of interfaces necessary to recognize a variety of targets. 

ERa transcriptional activity is blocked by antagonists such as OHT and RAL. The most 
striking feature of the structures of the OHT and RAL liganded ERa LBDs is that helix 12 is 

30 bound to the static region of the coactivator recognition groove (Figure 3B and (Brzozowski. et 
al., supra ). A comparison of these two structures with the structure of the coactivator/LBD 
complex reveals that in the antagonist complexes, the region of helix 12 with an NR box-like 
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sequence (LXXML versus LXXLL) (SEQ ID NO:2 versus SEQ ID NO: 1) functions as an 
intramolecular mimic of the coactivator helix (Figure 5 and Brzozowski. et al.. supra ). 
Consistent with the proposals of others (Brzozowski, et al.. supra and Darimont. et al., supra ), 
this disposition of helix 12 directly affects the structure and function of the surface responsible 
5 for transcriptional activity in two ways. First, because helix 12 residues form an integral part of 
the coactivator binding site surface, the surface is incomplete when helix 12 is in the antagonist- 
bound conformation. In particular. Leu 539, Glu 542 and Met 543 are incorrectly oriented for 
coactivator recognition. Second, residues from the static region of this surface are bound to helix 
12 and are prevented from interacting with coactivator (Figures 3A and 3B). 

10 The sequence similarity of helix 12 of the ERa LBD to the LXXLL motif (SEQ ID 

NO:l) appears to be unique among nuclear receptors. The identities of the residues in this region 
of helix 12 in other nuclear receptors, although generally hydrophobic in character, do not as 
closely resemble the sequence of an NR box as those of ERa (Wurtz, et al., supra ). However, it 
is possible that an intramolecular inhibitor with a suboptimal recognition sequence would 

15 compete for coactivator binding given its extremely high local concentration. 

The binding of OHT to the ERa promotes a helix 12 conformation that inhibits binding 
of the coactivator. OHT does not directly interact with any helix 12 residues (Figure 4B). 
Moreover, the structure of the LBD in the region of the coactivator binding site groove that 
interacts with helix 12 in the OHT complex is the same in the DES and E2 complexes (Figures 

20 3A, 3B and 5). 

Numerous studies have demonstrated the importance of the OHT side chain in receptor 
antagonism (Jordan, et al., supra and Robertson, et al., J. Med . Chem . 25:167-71 (1982)). A 
comparison of the structures of the OHT and DES complexes reveals that the binding mode of 
the OHT side chain precludes the agonist-induced conformation of helix 12. The OHT side 

25 chain projects out of the ligand binding pocket between helices 3 and 1 1 (Figures 2B, 6B and 
6C). As a result, the positioning of helix 12 over the ligand binding pocket, as it is in the 
agonist-bound conformation, would bury the positively charged dimethylamino group of the 
OHT side chain within a hydrophobic cavity and produce steric clashes between the 
dimethylaminoethyl region of side chain and the side chain of Leu 540. 

30 In functional terms. OHT is not, however, simply "an agonist with a side chain". OHT 

binding promotes a conformation of the LBD that is distinct from that stabilized by either DES 



BNSDOCID: <WO 



9950658A2 I > 



WO 99/50658 



PCT7US99/06937 



or E2 binding. These different conformations impose different restrictions on the positioning of 
helix 12. 

Helices 3, 8 and 1 1 in the DES and E? complexes are between one to two turns longer 
than they are in the OHT complex (Figure 6A and (Brzozowski, et al.. supra ). Helix 1 1 ends at 
5 Cys 530 in the DES and E2 complexes and it ends at Tyr 526 in the OHT complex. Helix 12 

begins at Leu 536 in the OHT complex. This appears to be necessary; in the antagonist complex. 
Leu 536 forms a cooperative network of nonpolar contacts and hydrogen bonds with Glu 380 
and Tyr 537 that stabilizes the N-terminus of helix 12 (Figure IB). Therefore, if helix 12 were to 
bind the static region of the coactivator binding site in the presence of agonist, the loop 

10 connecting helices 1 1 and 12 would be required to span ~17A over five residues. Although 
theoretically possible, this conformation would be highly strained and hence unlikely. In 
contrast, the longer loop connecting helices 1 1 and 12 in the OHT complex allows helix 12 to 
extend to the static region of the coactivator binding groove. 

In the DES and E 2 complexes, helix 12 and the loop connecting helices 1 1 and 12 pack 

15 against helices 3 and II, whereas they do not in the OHT complex (Figures 2A and 2B and 

(Brzozowski, et al., supra ). A recently described structure of the E 2 -LBD complex suggests that 
the longer helices in the DES and E 2 complexes are not dependent upon the interactions helix 12 
forms in the agonist-bound conformation (Tanenbaum, et al., supra ). In this structure, a crystal 
packing artifact forces helix 12 to contact a symmetry-related molecule. Helix 12 is clearly not 

20 positioned over the ligand binding pocket in this structure. Nevertheless, helices 3, 8 and 1 1 are 
longer than they are in the OHT complex (Figure 6A). Hence the longer helices of the agonist 
complexes occur independently of the positioning of helix 12 over the ligand binding pocket and 
are instead a direct result of agonist binding. 

The secondary structure differences between the agonist complexes and the OHT 

25 complex arise from distinct arrangements of packing interactions induced by the different 
ligands. A cooperative network of van der Waals contacts, organized around DES or E2, 
between various hydrophobic residues from helices 3, 7, 8 and 1 1 and the P hairpin appears to 
stabilize the longer helices in the agonist complexes (Figure 4A and 6D). The placement of the 
OHT B ring forces many of ligand binding pocket residues that surround it to adopt 

30 conformations that are dramatically different from those they adopt in either the DES or E2 

structures. As a result, many of the interresidue packing interactions present in the DES and E 2 
structures are either absent or altered in the OHT structure (Figure 6D). These structural 
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distortions apparently force the main chain from residues 339 to 341. 421 to 423. and 527 to 530 
(which form parts of helices 3, 8 and 1 1 respectively in the agonist structures ) to adopt an 
extended conformation in the OHT structure (Figures 6A-D). 

Therefore the binding of OHT has two distinct effects on the positioning of helix 12. each 
5 of which contributes to antagonism (Figure 7). Helix 12 is prevented from being positioned over 
the ligand binding pocket by the OHT side chain. In addition, the alternative packing 
arrangement of ligand binding pocket residues around OHT stabilizes a conformation of the 
LBD that permits helix 12 to reach the static region of the coactivator binding site and mimic 
bound coactivator. 

10 These mechanisms do not appear to be specific to OHT. The side chain of RAL, like that 

of OHT, sterically hinders the agonist-bound conformation of helix 12 (Brzozowski. et al.. 
supra ). In addition, helix 1 1 appears to end at Met 528 in the RAL complex. This may result 
from the distortions in the binding pocket in the vicinity of His 524 directed by RAL binding 
(Brzozowski, et al., supra ). 

15 These results are supported by the experimental data provided in the examples below, 

which are also intended to illustrate various aspects of this invention; These examples do not 
limit the scope of this invention. 

EXAMPLES 
Example 1 

20 Experimental Procedures 

Protein Expression and Purification 

The human ERa-LBD 297-554 was overexpressed as described previously (Seielstad. et 

al., Mol . Endocrinol . 9:647-658 (1995)) in BL21(DE3)pLysS cells transformed with a modified 

pET-23d-ERG vector that contained the sequence Met-Asp-Pro fused to residues 297 through 
25 554 of the hERa (provided by Paul Sigler of Yale University). Clarified bacterial lysates were 

adjusted to 3 M in urea and 0.7 M in NaCl and then applied to a 10-ml column of estradiol- 

Sepharose (Greene, et al., Proc . Natl . Acad . Sci . USA 77:51 15-5119 (1980); Landel, et al., Mol . 

Endocrinol . 8: 1 407- 1419(1 994); Landel, et al.. J. Steroid Biochem . Molec . Biol . 63 :59-73 

(1997)).^ 

30 To carboxymethylate the solvent-accessible cysteines, the bound hERa-LBD was treated 

with 5 mM iodoacetic acid in 10 mM Tris, pH 8.1. 250 mM NaSCN (Hegy, et al.. Steroids 
61:367-373 (1996)). Protein was eluted with 3 x 10-5 M ligand (either DES or OHT) in 30-100 
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ml of 50 mM Tris, 1 mM EDTA. 1 mM DTT and 250 mM NaSCN. pH 8.5. The yield of hERa- 
LBD was typically close to 100% (Seielstad. et al.. Biochemistry 34:12605-12615 (1995)). The 
affinity-purified material was concentrated and exchanged into 20 mM Tris, 1 mM EDTA, 4 mM 
DTT. pH 8. 1 by ultrafiltration. The protein was bound to a Resource Q column (Pharmacia) and 
5 then eluted with a linear gradient of 25-350 mM NaCl in 20 mM Tris. pH 8.1. 1 mM DTT. The 
hERct-LBD-ligand complexes eluted at 1 50-200 mM NaCl. Pooled fractions were concentrated 
by ultrafiltration and analyzed by SDS-PAGE, native PAGE, and electrospray ionization mass 
spectrometry. 
GST-pulldown Assays 

1 0 A fusion between glutathione-S-transferase (GST) and amino acids 282-595 of hERct 

was constructed by subcloning the EcoRI fragment from pSG5 ERa-LBD (Lopez et al., 
submitted manuscript) into pGEX-3X (Pharmacia). The He 358-> Arg, Lys 362->Ala, and Leu 
539->Arg mutations were introduced into the GST-LBD construct using the QuikChange Kit 
(Stratagene) according to the manufacturer's instructions. The Val 376->Arg and Glu 542->Lys 

15 mutations were created in the GST-LBD construct by subcloning the Bsml/Hindlll fragments of 
derivatives of pSG5-ER-HEGO (Tora. et al., EMBO J. 8:1981-6 (1989)) into which these 
mutations had already been introduced. All constructs were verified by automated sequencing 
(University of Chicago Cancer Research Center DNA Sequencing Facility). 

The wild-type and mutant GST-LBDs were expressed in BL21(DE3) cells. Total ligand 

20 binding activity was determined by a controlled pore glass bead assay (Greene, et al., Mol . 

Endocrinol . 2:714-726 (1988)) and protein levels were monitored by western blotting with a 
monoclonal antibody to hERa (H222). Cleared extracts containing the GST-LBDs were 
incubated in buffer alone (50 mM Tris, pH 7.4, 150 mM NaCl, 2 mM EDTA, 1 mM DTT, 0.5% 
NP-40 and a protease inhibitor cocktail) or with 1 |iM of either DES or OHT for 1 hour at 4°C. 

25 Extract samples containing thirty pmol of GST-LBD were then incubated with 10 |il glutathione- 
Sepharose-4B beads (Pharmacia) for 1 hour at 4°C. Beads were washed five times with 20 mM 
HEPES, pH 7.4, 400 mM NaCl, and 0.05% NP-40. 35 S-labeled GRIP1 was synthesized by in 
vitro transcription and translation using the TNT Coupled Reticulocyte Lysate System (Promega) 
according to the manufacturers instructions and pSG5-GRJPl (provided by Michael Stallcup of 

30 the University of Southern California) as the template. Immobilized GST-LBDs were incubated 
for 2.5 hours with 2.5 \x\ aliquots of crude translation reaction mixture diluted in 300 |il of Tris- 
buffered saline (TBS). After five washes in TBS containing 0.05% NP-40. proteins were eluted 
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by boiling ihe beads for 10 minutes in sample buffer. Bound 3> S-GRJP1 was quantitated by 
fluorography following SDS-PAGE. 
Crystallization and Data Collection 

Crystals of the DES-hERa LBD-GRIP1 NR Box II peptide complex were obtained by 
5 hanging drop vapor diffusion. Prior to crystallization, the DES-hERa LBD (residues 297-554) 
complex was incubated with a 2-4 fold molar excess of the GRIP1 NR Box II peptide (SEQ ID 
NO:4) for 7-16 hr. Two |^L samples of this solution were mixed with equal volume samples of 
reservoir buffer consisting of 25-27% (w/v) PEG 4000. 90 mM Tris (pH 8.75-9.0) and 180 mM 
Na Acetate and suspended over wells containing 800 jj.L of the reservoir buffer. After 4-7 days 

10 at 19-21°C. rod-like crystals were obtained. The coactivator complex crystals lie in the 

spacegroup P2 ( with cell dimensions a=54.09, b=82.22. c=58.04 and p=l 1 1 .34. Two molecules 
each of the DES-LBD and the coactivator peptide form the asymmetric unit. A 200 |im x 40 jam 
x 40 |am crystal was transferred to a cryosolvent solution containing 25% (w/v) PEG 4000, 10% 
(w/v) ethylene glycol, 100 mM Tris (pH 8.5), 200 mM Na Acetate and 10 ^iM peptide and frozen 

15 in an N2 stream at - 1 70°C in a rayon loop. Diffraction data from this crystal were measured at - 
170°C using a 300 mm MAR image plate at the Stanford Synchrotron Radiation Laboratory 
(SSRL) at beamline 7-1 at a wavelength of 1 .08 A. 

Crystals of the hERa LBD complexed to OHT were obtained by the hanging drop vapor 
diffusion method. Equal volume aliquots (2 ^iL) of a solution containing 3.9 mg/mL protein- 

20 ligand complex and the reservoir solution containing 9% (w/v) PEG 8000, 6% (w/v) ethylene 
glycol, 50 mM HEPES (pH 6.7) and 200 mM NaCl were mixed and suspended over 800 ^iL of 
the reservoir solution. Hexagonal plate-like crystals formed after 4-7 days at 21-23°C. Both 
crystal size and quality were improved through microseeding techniques. These crystals belong 
to the space group P6>22 with cell parameters a=b=58.24A and c=277.47A. The asymmetric 

25 unit consists of a single LBD monomer; the dimer axis lies along a crystallographic two-fold. A 
single crystal (400 |im x 250 jam x 40 jam) was briefly incubated in a cryoprotectant solution 
consisting of 10% (w/v) PEG 8000, 25% (w/v) ethylene glycol, 50 mM HEPES (pH 7.0) and 200 
mM NaCl and then flash frozen in liquid N2 suspended in a rayon loop. Diffraction data were 
measured at -1 70°C using a 345 mm MAR image plate at SSRL at beamline 9-1 and at a 

30 wavelength of 0.98A. 
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The images of both data sets were processed with DENZO and scaled with 
SCALEPACK (Otwinowski. et al.. Methods Enzvmol . 276:307-326 ( 1 997)) using the default - 
3o cutoff. 

Structure Determination and Refinement 
5 Initial efforts to determine the structure of the DES-LBD-GRIP1 NR Box II peptide 

complex utilized a low resolution (3.1 A) data set (data not shown). A self-rotation search 
implemented with POLARRFN f The CCP4 suite: programs for protein crystallography", Acta 
Crvstallocr . D 50:760-763 (1994)) indicated the presence of a noncrystallographic dyad. The 
two LBDs in the asymmetric were located by molecular replacement in AMoRe (CCP4, 1994) 

10 using a partial polyalanine model of the human RARy LBD (Renaud. et al., supra ) as the search 
probe (R=58.2%, CC=35.6% after placement of both monomers). Given that the model at this 
point was both inaccurate (r.m.s.d. 1 .7 A between this model and the final model based on Cot 
positions) and incomplete (accounting for only -45% of the total scattering matter in the 
asymmetric unit), an aggressive density modification protocol was undertaken. Iterative cycles 

15 of two-fold NCS averaging in DM (CCP4, 1994) interspersed with model building in MOLOC 
(Muller, et al., Bull . Soc . Chim . Belg . 97:655-667 (1988)) and model refinement in REFMAC 
(Murshudov, et al., Acta Crvstallogr . D 53:240-255 (1997)) (using tight NCS restraints) were 
used to quickly build a model of the LBD alone. For this procedure, MAMA (KJeywegt, et al., 
"Halloween... masks and bones. In From First Map to Final Model", Bailey, et al, eds., 

20 Warrington, England, SERC Daresbury Laboratory, 1994) was used for all mask manipulations 
and PHASES (Furey, et al.. PA33 Am. Crvst . Assoc . Mtg . Abstr. 18:73 (1990)) and the CCP4 
suite (CCP4, 1994) were used for the generation of structure factors and the calculation of 
weights. 

However, although the DES-LBD-GRIP1 NR Box II peptide complex model accounted 
25 for -90% of the scattering matter in the asymmetric unit, refinement was being hampered by 
severe model bias. The OHT complex data set was then collected. Starting with one of the 
monomers of the preliminary DES-LBD model as the search probe, molecular replacement in 
AMoRe was used to search for the location of LBD in this crystal form in both P6i22 and P6522. 
A translation search in P6 5 22 yielded the correct solution (R=53.8%. CC=38.2%). In order to 
30 reduce model bias. DMMULTI (CCP4. 1994) was then used to project averaged density from the 
DES complex cell into the OHT complex cell. Using MOLOC. a model of the LBD was built 
into the resulting density. The model was refined initially in REFMAC and later with the 
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simulated annealing, positional and B-factor refinement protocols in X-PLOR (Brunger, X- 
PLOR Version 3.843, New Haven. Connecticut: Yale University, 1996) using a maximum- 
likelihood target (Adams, et al., Proc . Natl . Acad . Sd. USA 94:5018-23 (1997)). Anisotropic 
scaling and a bulk solvent correction were used and all B-factors were refined isotropically. 
Except for the R frce set (a random sampling consisting of 8% of the data set), all data between 41 
and 1.9A (with no a cutoff) were included. The final model consisted of residues 306-551, the 
ligand and 79 waters. According to PROCHECK (CCP4, 1994), 91.6% of all residues in the 
model were in the core regions of the Ramachandran plot and none were in the disallowed 
regions. 

The high resolution data set of the DES-LBD-GRIP1 NR Box II peptide complex became 
available when the R free of the OHT-LBD model was -31%. Both monomers in the asymmetric 
unit of the DES complex crystal were relocated using AMoRe and the incompletely refined 
OHT-LBD model (with helix 12 and the loop between helices 1 1 and 12 removed) as the search 
model. The missing parts of the model were built and the rest of the model was corrected using 
MOLOC and two-fold averaged maps generated in DM. Initially, refinement was carried out 
with REFMAC using tight NCS restraints. At later stages, the model was refined without NCS 
restraints using the simulated annealing, positional and B-factor refinement protocols in X- 
PLOR and a maximum-likelihood target. All B-factors were refined isotropically and 
anisotropic scaling and a bulk solvent correction were used. The R frce set contained a random 
sample of 6.5% of all data. In refinement, all data between 27 and 2.03 A (with no a cutoff) were 
used. The final model was composed of residues 305-549 of monomer A, residues 305-461 and 
470-554 of monomer B, residues 687-697 of peptide A, residues 686-696 of peptide B, two 
ligand molecules, 147 waters, two carboxymethyl groups and a chloride ion. According to 
PROCHECK, 93.7% of all residues in the model were in the core regions of the Ramachandran 
plot and none were in the disallowed regions. 

Figure I A provides a view of a 2Fo-Fc electron density map calculated at 2.03 A 
resolution and contoured at 1.0 cr showing the GRIP1 NR box II interaction with the LBD. The 
GRIP1 NR Box II peptide (SEQ ID NO:4) was omitted from the model prior to map calculation, 
lie 689 from the peptide and two of the three receptor residues with which it interacts (Glu 542 
and Leu 539) are labeled. Asp 538 has been omitted for clarity. The hydrogen bonds between 
the y-carboxylate of Glu 542 and the amides of residues 689 and 690 of the peptide are depicted 
as dashed orange bonds. Figure 1 B provides a view of a 2Fo-Fc electron density map calculated 
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at 1.90A resolution and contoured at 1.0 a showing the N-terminal region of helix 12. The 
dashed orange bonds depict the water-mediated hydrogen bond network between the imidazole 
ring of His 377. the y-carboxylate of Glu 380. and the amide of Tyr 537. The three labeled 
residues (Glu 380. Leu 536 and Tyr 537) interact with each other through van der Waals contacts 
and/or hydrogen bonds. Intriguingly, mutations in each these three residues dramatically 
increase the transcriptional activity of unliganded ERa LBD (Eng, et al., Mol . Cell . Biol . 
1 7:4644-4653 ( 1 997); Lazennec, et al., Mol. Endocrinol . 1 1 : 1 375-86 ( 1 997); White, et al., 
EMBO J. 16:1427-35(1997)). 

Example 2 
Structure Determination 

GRIP 1 , a mouse pi 60 coactivator, interacts both in vivo and in vitro with the ERa LBD 
bound to agonist (Ding, et aL supra ), but not with the LBD bound to antagonist (Norris, et al., J. 
Biol . Chem. 273:6679-88 (1998)). Mutational studies of GRIP 1 and its human homologue TIF2 
suggest that of the three NR boxes from GRIP1, NR box II (residues 690 to 694) binds most 
tightly to the ERa LBD (Ding, et al., supra and Voegel, et al., supra) . 

Competition assays indicate that a 13 residue GRIP1 NR Box II peptide, NH 2 - 
KHKILHRLLQDSS-CO2H (SEQ ID NO:4), corresponding to residues 686-698 of GRIP1 (Ding, 
et aL, supra ), synthesized by standard solid phase methods, binds specifically to the agonist- 
bound ERa LBD (IC 50 <0.4jiM; Kushner, unpublished) and to other agonist-bound NR LBDs 
(Ding, et al., supra and Darimont, at al, supra ). 

An electrophoretic mobility shift assay was used to demonstrate that the GRIP1 NR Box 
II peptide (SEQ ID NO:4) bound the ERa LBD in the presence of the agonist DES, but not the 
antagonist, OHT. Eight microgram samples of purified hERa-LBD bound to either DES or 
OHT were incubated in the absence of the GRIP1 NR Box II peptide (SEQ ID NO:4), i.e., buffer 
alone, or in the presence of either a 2-fold or 10-fold molar excess of the GRIP1 NR Box II 
peptide. The binding reactions were performed on ice for 45 minutes in 10 (al of buffer 
containing 20mM Tris, pH 8.1, ImM DTT, and 200mM NaCl and then subjected to 6% native 
PAGE. Gels were stained with GELCODE Blue Stain reagent (Pierce). 

In the presence of the NR box II peptide, the migration of the DES-LBD complex was 
retarded. In contrast, peptide addition had no effect on the mobility of the OHT-LBD complex. 
Hence, this peptide fragment of GRIP1 possesses the ligand-dependent receptor binding activity 
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characteristic of the full-length protein. These observations suggest that the GRIP1 NR Box II 
peptide is a valid model for studying the interaction between GRIP1 and the ERa LBD. 

In order to characterize structurally the interaction between the GRIP1 NR Box II peptide 
and the ERa LBD. recombinant human ERa LBD (residues 297-554) was crystallized bound to 
5 both DES and the GRIP1 NR Box II peptide. The ERa LBD bound to OHT was also 
crystallized in order to determine the mechanism by which this antagonist blocks 
coactivator/ERa interaction. X-ray diffraction data from these crystals were measured and the 
structures were determined by a combination of molecular replacement (using a modified 
version of the coordinates of the human retinoic acid receptor y (RARy) LBD. Renaud, et al„ 

10 supra , as the search model) and aggressive density modification. 

The structure of the DES-ERa LBD-GRIP1 NR Box II peptide complex has been refined 
to a crystallographic R-factor of 19.9% (R frcc =25.0%) using data to 2. 03 A resolution, as shown 
in Figure 1 A and Table 2. The structure of the OHT-ERa LBD complex has been refined using 
data to 1.90A to a crystallographic R-factor of 23.0% (R frcc =26.2%), as shown in Figure IB and 

15 Table 2. 

Table 2 

Summary of Crystallographic Statistics 

Ligand [ 



Data Collection DES OHT 

20 Space group P2| P6 5 22 

Resolution 2.03 1.90 

Observations 1 04 1 89 269253 

Unique 30265 23064 

Completeness (%) 98.4 99.1 

25 R S ym(%) a ~ 7.8 7.0 

Average I/al 9.8 16.1 
Refinement 

Number of non-hydrogen atoms 4 1 80 2070 

Rcyst (%) b /Rfrcc (%) 1 9.9/25.0 23.0/26. 1 

30 Bond r.m.s. deviation (A) 0.006 0.006 

Angle r.m.s. deviation (°) 1 .05 1 .05 

Average B factor (A 2 ) 34.0 40.4 



Rsym =Ii I Ii- <Ii> \t Sili where <h> is the average 
intensity over symmetry equivalents 
35 b Roya-ZlFo-Fcl/ZlFol 
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Example 3 

Overall Structure of the DES-LBD-GRIPl NR Box II Peptide Complex 
The asymmetric unit of the DES-LBD-GRIPl NR Box II peptide complex crystals 
contains the same noncrystallographic dimer of LBDs that has been observed in the previously 
5 determined structures of the LBD bound to both Et and RAL (Brzozowski. et aL supra and 

Tanenbaum. et aL supra ). Beyond the flexible loops between helices 2 and 3 and helices 9 and 
10, the two LBDs of the dimer adopt similar structures (r.m.s.d. 0.47A based on Ca positions). 
The conformation of each LBD complexed with DES closely resembles that of the LBD bound 
to E? (Brzozowski. et aL supra ); each monomer is a wedge shaped molecule consisting of three 

10 layers of eleven to twelve helices and a single beta hairpin (Figure 2 A). In each LBD, the 

hydrophobic face of helix 12 is packed against helices 3. 5/6 and 1 1 covering the ligand binding 
pocket (Figure 2A). One GRIP1 NR Box II peptide is bound to each LBD in a hydrophobic cleft 
composed of residues from helices 3, 4, 5 and 12 and the turn between 3 and 4 (Figures 2A and 
3A). The density for both peptides in the asymmetric unit is continuous and unambiguous 

15 (Figure 1 A). Residues 687 to 697 from the GRIP1 NR Box II peptide, designated peptide A. and 
residues 686 to 696 from the GRIP1 NR Box II peptide, designated peptide B, have been 
modeled; the remaining residues are disordered. Given that each peptide lies within a different 
environment within the crystal, it is striking that from residues lie 689 to Gin 695 each peptide 
forms a two turn, amphipathic a helix (Figures 2A and 3 A). Flanking this region of common 

20 secondary structure, the peptides adopt dissimilar random coil conformations. 

The overall structures of the DES-ERot LBD-GR1P1 NR Box II peptide complex and the 
OHT-ERct LBD complex are illustrated in Figure 2. In Figure 2A, the coactivator peptide and 
the LBD are shown as ribbon drawings. The peptide is colored gold and helix 12 (residues 538- 
546) is colored magenta. Helices 3, 4 and 5 (labeled H3. H4 and H5 respectively) are colored 

25 blue. DES, colored green, is shown in space-Filling representation. In Figure 2B, the LBD is 
depicted as a ribbon drawing. As in Figure 2A. helix 12 (residues 536-544) is colored in 
magenta and helices 3. 4 and 5 are colored blue. OHT, in red. is shown in space-filling 
representation. 

Example 4 

30 The NR Box II Peptide-LBD Interface 

The binding of the GRIP1 NR Box II peptide to the ERa LBD buries 1000A : of 
predominantly hydrophobic surface area from both molecules. The GRIP1 NR Box II peptide 
binding site is a shallow groove composed of residues Leu 354. Val 355. He 358. Ala 361 and 
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Lys 362 from helix 3; Phe 367 and Val 368 from helix 4; Leu 372 from the turn between helices 
3 and 4; Gin 375, Val 376. Leu 379 and Glu 380 from helix 5: and Asp 538. Leu 539. Glu 542 
and Met 543 from helix 12 (Figure 3 A). The floor and sides of this groove are completely 
nonpolar, but the ends of this groove are charged (Figure 3C). 
5 The LBD interacts primarily with the hydrophobic face of the GRJP1 NR Box II peptide 

a helix formed by the side chains of He 689 and the three LXXLL motif (SEQ ID NO:l) 
leucines (Leu 690, Leu 693 and Leu 694). The side chain of Leu 690 is deeply embedded within 
the groove and forms van der Waals contacts with the side chains of He 358, Val 376, Leu 379. 
Glu 380 and Met 543 (Figure 3 A and 3C). The side chain of Leu 694 is similarly isolated within 

10 the groove and makes van dcr Waals contacts with the side chains of He 358. Lys 362, Leu 372, 
Gin 375, Val 376 and Leu 379 (Figure 3 A and 3C). In contrast, the side chains of both He 689 
and the second NR box leucine. Leu 693. rest against the rim of the groove (Figure 3 A and 3C). 
The side chain of He 689 lies in a shallow depression formed by the side chains of Asp 538, Leu 
539 and Glu 542. The side chain of Leu 693 makes nonpolar contacts with the side chains of He 

15 358 and Leu 539. 

The charged and polar side chains which form the hydrophilic face of the peptide helix 
project away from the receptor and either interact predominantly with solvent or form symmetry 
contacts (Figure 1 A). None of the side chains of the polar and charged residues outside the 
helical region of either peptide in the asymmetric unit, with the exception of Lys 688 of peptide 

20 B, is involved in hydrogen bonds or salt bridges with its associated LBD monomer. The e-amino 
group of Lys 688 of peptide B hydrogen bonds to the side chain carboxylate of Glu 380 of 
monomer B. This interaction is presumably a crystal artifact; the main chain atoms of the N- 
terminal three residues of peptide B are displaced from monomer B and interact extensively with 
a symmetry-related LBD. 

25 In addition to interacting with the hydrophobic face of the peptide helix, the LBD 

stabilizes the main chain conformation of the NR box peptide by forming capping interactions 
with both ends of the peptide helix. Glu 542 and Lys 362 are positioned at opposite ends of the 
peptide binding site (Figure 3 A). The side chains of Glu 542 and Lys 362 form van der Waals 
contacts with main chain and side chain atoms at the N- and C-terminal turns of the peptide helix 

30 respectively . These interactions position the stabilizing charges of the y-carboxylate of Glu 542 
and e-amino group of Lys 362 near the ends of the GRIP1 NR Box II peptide helix (Figure 3C). 
The side chain carboxylate (y-carboxylate) of Glu 542 hydrogen bonds to the amides of the 
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residues of N-terminal turn of the peptide helix (residues 688 and 689 of peptide A; residues 689 
and 690 of peptide B) (Figure 1 A). Similarly, the e-amino group of Lys 362 hydrogen bonds to 
the carbonyls of the residues of the C-tcrminal turn of the peptide helix (residue 693 of peptide 
A; residues 693 and 694 of peptide B) (Figure 5). 

To test the importance of the GRIP1 NR Box II pcptide/LBD interface observed in the 
crystal, a series of site-directed mutations were introduced into the ERa LBD. These mutations 
were designed either to simultaneously perturb the structural integrity and the nonpolar character 
of the floor of the binding groove (He 358->Arg, Val 376->Arg and Leu 539->Arg) or to prevent 
the formation of the capping interactions (Lys 362->Ala and Glu 542->Lys). Fusions of 
glutathione-S-transferase (GST) to the wild-type and mutant LBDs were analyzed for their 
ability to bind 35 S-labeled GRIP1 in the absence of ligand or in the presence of DES or OHT. 

35 S-labeled GRIP1 was incubated with either immobilized GST, immobilized wild type 
GST-hERa LBD, or immobilized mutant GST-LBDs in the absence of ligand or in the presence 
of DES or OHT. The bound GRIP1 was quantitatcd after SDS-PAGE. I358R, mutant LBD 
containing a Ile->Arg substitution at residue 358; K362A, mutant LBD containing a Lys->Ala 
substitution at residue 362; V376R, mutant LBD containing a Val->Arg substitution at residue 
376; L539R, mutant LBD containing a Leu->Arg substitution at residue 539; E542K, mutant 
LBD containing a Glu->Lys substitution at residue 542. 

In the absence of ligand or in the presence of OHT, fusions to the wild-type protein and 
all of the mutant LBDs showed no detectable binding to GRIP1. The He 358->Arg, Val 376- 
>Arg and Leu 539->Arg mutants were all unable to interact with coactivator in the presence of 
agonist, confirming the importance of the packing interactions observed in the crystal. 
Disruption of either the N- or C-terminal capping interaction also compromised GRIP1 binding 
in the presence of agonist. Only the wild-type GST-LBD was able to recognize the coactivator 
in the presence of DES. 

Example 5 
Agonist Recognition 

Despite its different shape and larger molecular volume, DES (273 A 3 ) is accommodated 
within the same binding pocket that recognizes E2 (252A 3 ). In its receptor complex. DES is 
completely encased within the narrower half of the LBD in a predominantly hydrophobic cavity 
composed of residues from helices 3, 6, 7, 8, 1 1 , and 1 2 as well as the S 1/S2 hairpin (Figures 2A 
and4A). 
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The interaction of DES with ERct resembles that of E : . One of the phenolic rings of DES 
lies in the same position as the E? A ring near helices 3 and 6. Like the aromatic ring of the Ei, 
the DES A ring (Figure 4A) is engaged by the side chains of Phe 404, Ala 350, Leu 387 and Leu 
391 with its phenolic hydroxyl forming hydrogen bonds to the y-carboxylate of GIu 353. to the 
5 guanidinium group of Arg 394, and to a structurally conserved water molecule. The A' ring of 
DES (Figure 4A) is bound near helices 7, 8 and 1 1 adjacent to the location of the E 2 C and D 
rings. This ring forms van der Waals contacts not only with Gly 521 and Leu 525, like the D 
ring of E 2 , but also with Met 343, Leu 346 and Met 421 (Figure 4A). Even though it is located 
1 .7A from the position of the D ring hydroxyl, the DES A' ring phenolic hydroxyl is still able to 

10 hydrogen bond to the imidazole ring of His 524 (Figure 4A). 

DES also forms contacts with the LBD that Ei does not. There are unoccupied cavities 
adjacent to the a face of the B ring and the P face of the C ring of the Ei (Brzozowski. et ah. 
supra and Tanenbaum, et al.„ supra ). The ethyl groups of DES, which project perpendicularly 
from the plane of the phenolic rings, fit snugly into these spaces. The resulting additional 

15 nonpolar contacts with the side chains of Ala 350, Leu 384. Phe 404, and Leu 428 (Figure 4 A) 
may account for the higher affinity of DES for the receptor (Kuiper, et ah, Endocrinology 
138:863-70 (1997)). 

Except for Met 421 and Met 528 (both of which contact only DES) and Met 388 and He 
424 (both of which contact only E 2 ), the ER is able to use the same residues to form all of the 

20 observed hydrogen bonds and van der Waals contacts with both agonists (Figure 4A, 

Brzozowski. et al„ supra , and Tanenbaum, et al., supra ). This remarkable adaptability is 
presumably the result of both relatively large molecular volume of the binding pocket (-500A 3 in 
both complexes) and its apparent structural plasticity. In particular, at the DES A' ring/steroid D 
ring end of the binding pocket. Met 343, Met 421, His 524 and Met 528 adopt different packing 

25 configurations in response to each ligand (data not shown). This plasticity may be necessary to 
allow the receptor to recognize endogenous estrogens such as estrone and estriol. which differ 
structurally from E2 at the D ring. 

Example 6 
Structure of the OHT-LBD Complex 

30 The binding of OHT induces a conformation of the LBD that differs in both secondary 

and tertiary structural organization from that driven by DES binding. In the DES complex, the 

main chain from residues 339 to 341, 421 to 423. and 527 to 530 form pans of helices 3. 8 and 
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1 1 respectively. In contrast, these regions adopt an extended conformation in the OHT complex 
(Figures 2A. 2B and 6A). In addition, the composition and orientation of helix 12 are different 
in the two structures. Helix 12 in the DES complex consists of residues 538 to 546 whereas 
helix 12 in the OHT complex consists of residues 536 to 544. Most dramatically, rather than 
5 covering the ligand binding pocket as it does in the DES complex, helix 12 in the OHT complex 
occupies the part of the coactivator binding groove formed by residues from helices 3, 4. and 5, 
and the turn connecting helices 3 and 4 (Figures 2A, 2B and 3B). This alternative conformation 
of helix 12 appears to be similar to that observed in the RAL complex (Brzozowski, et al., 
supra ). 

1 0 Example 7 

Helix 12-LBD Interface 

Except for the orientation of helix 12. the structure of the peptide binding groove is 

almost identical in the DES and OHT complexes (Figures 3A and 3B). The region of this groove 

outside of helix 12 is referred to herein as the "static region" of the NR box binding site. Helix 

15 12 in the OHT complex and the NR box peptide helix in the DES complex interact with the static 
region of the coactivator recognition groove in strikingly similar ways. 

Helix 12 mimics the hydrophobic interactions of the NR box peptide with the static 
region of the groove with a stretch of residues (residues 540 to 544) that resembles an NR box 
(LLEML instead of LXXLL) (SEQ ID NO:3 instead of SEQ ID NO:l). The side chains of Leu 

20 540 and Met 543 lie in approximately the same locations as those of the first and second motif 
leucines (Leu 690 and Leu 693) in the peptide complex (Figure 5). Leu 540 is inserted into the 
groove and makes van der Waals contacts with Leu 354, Val 376 and Glu 380 (Figures 3B and 
3D). Met 543 lies along the edge of the groove and forms van der Waals contacts with the side 
chains of Leu 354, Val 355 and He 358 (Figures 3B and 3D). The side chain position of Leu 544 

25 almost exactly overlaps that of the third NR box leucine, Leu 694 (Figure 5). Deep within the 

groove, the Leu 544 side chain makes van der Waals contacts with the side chains of He 358, Lys 
362. Leu 372. Gin 375, Val 376 and Leu 379 (Figures 3B and 3D). 

Helix 12 in the OHT complex is also stabilized by N- and C-terminal capping 
interactions. Lys 362 interacts with the C-terminal turn of helix 12 much as it does with the 

30 equivalent turn of the peptide helix (Figures 3A and 3B). The Lys 362 side chain packs against 

the C -terminal turn of the helix 12 with its e-amino group hydrogen bonding to the carbonyls of 

residues 543 and 544 (Figure 5). Given that the capping interaction at the N-terminal turn 

coactivator helix is formed by a helix 12 residue (Glu 542). the N-terminal turn of helix 12 in the 
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antagonist complex is forced to interact with another residue. Glu 380 (Figures 3B and 3D), The 

Glu 380 y-carboxylate forms van der Waals contacts with Tyr 537 and interacts with the amide 

of Tyr 537 through a scries of water-mediated hydrogen bonds (Figure IB). 

In addition to forming these "NR box-like" interactions, helix 12 also forms van der 

5 Waals contacts with areas of the LBD outside of the coactivator recognition groove. The side 

chain of Leu 536 forms van der Waals contacts with Glu 380 and Trp 383 and that of Tyr 537 

forms van der Waals contacts with His 373, Val 376 and Glu 380 (Figures IB, 3B and 3D). As a 

result of these contacts, helix 12 in the OHT complex buries more solvent accessible surface area 

(-I200A 2 ) than the NR box peptide in the DES complex. 

10 Example 8 

OHT Recognition 

OHT is bound within the same pocket that recognizes DES, E2 and RAL. The orientation 
of OHT within the binding pocket appears to be dictated by the positioning of two structural 
features of this ligand, the phenolic A ring and the bulky side chain (Figures 4B and 6C). The A 

1 5 ring of OHT is bound in approximately the same location as the A ring of DES near helices 3 
and 6 with its phenolic hydroxy 1 hydrogen bonding to a structurally conserved water and to the 
side chains of Glu 353 and Arg 394 (Figure 4B). Like the bulky side chain of RAL. the side 
* chain of OHT exits the binding pocket between helices 3 and 1 1 (Figures 2B and 4B). The OHT 
C ring (Figure 4B) forms van der Waals contacts with the side chains of Met 343, Leu 346, Thr 

20 347, Ala 350, Trp 383. Leu 384, Leu 387 and Leu 525. The positioning of the flexible 

dimethylaminoethyl region of the side chain is stabilized by van der Waals contacts with Thr 
347, Ala 350 and Trp 383 and by a salt-bridge between the dimethylamino group of the side 
chain and the P-carboxylate of Asp 351, which lies 3.8A away (Figure 4B). The positions of the 
A ring and the side chain in the context of the rigid triphenylethylene framework of OHT 

25 requires that the ethylene group of OHT lie in an orientation nearly orthogonal to that of the 

ethylene group of DES (Figures 4A, 4B and 6D). As a result, the B ring of OHT is driven more 
deeply into the binding pocket than the A 1 ring of DES (Figures 6B and 6C). 

This location of the OHT B ring apparently cannot be accommodated by the same 
mechanisms that allow the DES A 1 ring/E: D ring end of the binding pocket to adapt to the 

30 different structural features of DES and E 2 . Instead, the residues that contact the B ring (Met 

343, Leu 346. Met 421. He 424, Gly 52 L His 524 and Leu 525). most of which also interact with 

the A' ring of DES. adopt conformations distinct from the ones they adopt in the DES structure 

(Figure 6D). In fact, the location of the B ring actually precludes the side chain of one residue. 
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Met 421 . from adopting the same conformation that it adopts in the DES structure (Figures 6B 
and 6C). As a consequence of these B ring-induced side chain conformations, many interresidue 
van der Waals contacts present in the DES complex are absent in the OHT complex. For 
example, whereas Met 421 packs against His 524 from helix 1 1 and against Met 343 from helix 
5 3 in the agonist complexes, it is precluded by the location of the OI IT B ring from interacting 
with either of these residues in the antagonist complex (Figure 6D). 

The structural effects of the placement of the B ring are not limited to the residues that 
contact the B ring; the conformations of these residues force other residues throughout the 
binding pocket to, in turn, adopt alternative conformations. For instance, the conformation 

1 0 adopted by Met 421 in the OHT complex prevents the side chains of Phe 404 and Phe 425 from 
occupying the positions they take in the DES complex (Figure 6B and 6C). As a consequence, 
Phe 404 does not make van der Waals contacts with the OHT A ring as it does with the A rings 
of DES or E 2 (Figure 6C). In fact, Phe 404 only contacts the ethyl group of OHT (Figures 6C 
and 6D). The alternative conformations of the side chains of both the residues that directly 

1 5 contact the B ring and those that arc indirectly affected by it, force the main chain throughout the 
binding pocket to adopt a different conformation as well (Figure 6D). 

Identification and characterization of key residues within ligand binding domain of the 
ERa and extension of this information to other nuclear receptors shows that these residues are 
common for all nuclear receptors identified to date. Thus, the Examples presented herein 

20 demonstrate that information derived from the structure and function of the ERa ligand binding 
domain can be applied in design and selection of compounds that modulate binding of 
compounds to nuclear receptors for all members of the nuclear receptor family. 

All publications and patent applications mentioned in this specification are herein 
incorporated by reference to the same extent as if each individual publication or patent 

25 application was specifically and individually indicated to be incorporated by reference. 

The invention now being fully described, it will be apparent to one of ordinary skill in the 
an that many changes and modifications can be made thereto without departing from the spirit or 
scope of the appended claims. 
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Appendix 1 

Atomic Coordinates for Human ERa Complexed With DES and a GRIP1 NR Box II Peptide 
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Appendix 2 

Atomic Coordinates for Human ERtx Complexed With OHT 
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ATOM 


439 


CD 


ARG 


363 


11.468 


-1.086 


13.746 


1.00 


42.17 




ATOM 


440 


NE 


ARG 


363 


10.161 


-0.586 


14.158 


1.00 


45.19 




ATOM 


441 


CZ 


ARG 


363 


9.314 


-1.262 


14.929 


1.00 


49.41 




ATOM 


442 


NH1 


ARG 


363 


9.642 


-2.467 


15.374 


1.00 


48.02 




ATOM 


443 


NH2 


ARG 


363 


8.143 


-0.729 


15.261 


1.00 


51.54 


25 


ATOM 


444 


C 


ARG 


363 


12.654 


3.743 


12.943 


1.00 


37.40 




ATOM 


445 


O 


ARG 


363 


1 1 .567 


4.199 


13.303 


1.00 


38.22 




ATOM 


446 


N 


VAL 


364 


13.785 


4.442 


13.002 


1.00 


35.66 




ATOM 


447 


CA 


VAL 


364 


13.804 


5.836 


13.431 


1.00 


34.06 




ATOM 


448 


CB 


VAL 


364 


15.231 


6.271 


13.827 


1.00 


33.87 


30 


ATOM 


449 


CGI 


VAL 


364 


15.293 


7.779 


13.995 


1.00 


31.08 




ATOM 


450 


CG2 


VAL 


364 


15.641 


5.571 


15.113 


1.00 


31.30 




ATOM 


451 


C 


VAL 


364 


13.360 


6.591 


12.171 


1.00 


33.19 




ATOM 


452 


O 


VAL 


364 


14.028 


6.531 


11.146 


1.00 


33.04 




ATOM 


453 


N 


PRO 


365 


12.225 


7.310 


12.234 


1.00 


34.69 


35 


ATOM 


454 


CD 


PRO 


365 


1 1.359 


7.492 


13.413 


1.00 


34.19 




ATOM 


455 


CA 


PRO 


365 


1 1.724 


8.050 


1 1 .069 


1.00 


35.96 




ATOM 


456 


CB 


PRO 


365 


10.608 


8.918 


1 1.645 


1.00 


36.59 




ATOM 


457 


CG 


PRO 


365 


10.135 


8.157 


12.842 


1.00 


39.59 




ATOM 


458 


C 


PRO 


365 


12.756 


8.878 


10.321 


1.00 


37.19 


40 


ATOM 


459 


O 


PRO 


365 


13.430 


9.726 


10.907 


1.00 


40.29 




ATOM 


460 


N 


GLY 


366 


12.878 


8.624 


9.023 


1.00 


34.78 




ATOM 


461 


CA 


GLY 


366 


13.816 


9.371 


8.212 


1.00 


33.54 




ATOM 


462 


C 


GLY 


366 


15.168 


8.722 


8.007 


1.00 


34.26 




ATOM 


463 


O 


GLY 


366 


15.858 


9.035 


7.034 


1.00 


37.15 


45 


ATOM 


464 


N 


PHE 


367 


15.554 


7.814 


8.901 


1.00 


j j.l j 




ATOM 


465 


CA 


PHE 


367 


16.860 


7.164 


8.787 


1.00 


32.04 




ATOM 


466 


CB 


PHE 


367 


17.138 


6.291 


10.016 


1.00 


30.22 




ATOM 


467 


CG 


PHE 


367 


18.544 


5.773 


10.080 


1.00 


30.60 
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ATOM 


468 


CD1 


PHE 


367 


18.827 


4.446 


9.751 


1.00 


31.94 




ATOM 


469 


CD2 


PHE 


367 


19.589 


6.601 


10.485 


1.00 


29.20 




ATOM 


470 


CE1 


PHE 


367 


20.133 


3.950 


9.828 


1.00 


28.30 




ATOM 


471 


CE2 


PHE 


367 


20.896 


6.122 


10.568 


1.00 


28.12 


5 


ATOM 


472 


CZ 


PHE 


367 


21.171 


4.791 


10.240 


1.00 


25.41 




ATOM 


473 


C 


PHE 


367 


17.033 


6.333 


7.524 


1.00 


31.46 




ATOM 


474 


O 


PHE 


367 


18.073 


6.405 


6.883 


1.00 


32.30 




ATOM 


475 


N 


VAL 


368 


16.027 


5.541 


7.165 


1.00 


35.20 




ATOM 


476 


CA 


VAL 


368 


16.123 


4.718 


5.959 


1.00 


38.98 


10 


ATOM 


477 


CB 


VAL 


368 


15.076 


3.584 


5.945 


1.00 


40.61 




ATOM 


478 


CGI 


VAL 


368 


15.543 


2.447 


6.843 


1.00 


41.48 




ATOM 


479 


CG2 


VAL 


368 


13.717 


4.113 


6.390 


1.00 


41.60 




ATOM 


480 


c 


VAL 


368 


15.965 


5.523 


4.673 


1.00 


40.06 




ATOM 


481 


o 


VAL 


368 


16.156 


4.992 


3.579 


1.00 


41.66 


15 


ATOM 


482 


N 


ASP 


369 


15.608 


6.798 


4.798 


1.00 


38.65 




ATOM 


483 


CA 


ASP 


369 


15.465 


7.646 


3.621 


1.00 


37.15 




ATOM 


484 


CB 


ASP 


369 


14.700 


8.929 


3.954 


1.00 


39.89 




ATOM 


485 


CG 


ASP 


369 


1 3.254 


8.671 


4.302 


1.00 


45.59 




ATOM 


486 


OD1 


ASP 


369 


12.686 


7.672 


3.806 


1.00 


46.34 


20 


ATOM 


487 


OD2 


ASP 


369 


12.681 


9.472 


5.074 


1.00 


49.13 




ATOM 


488 


c 


ASP 


369 


16.855 


8.010 


3.136 


1.00 


34.91 




ATOM 


489 


o 


ASP 


369 


1 7.038 


8.431 


1.995 


1.00 


34.25 




ATOM 


490 


N 


LEU 


370 


17.838 


7.841 


4.016 


1.00 


31.76 




ATOM 


491 


CA 


LEU 


370 


19.229 


8.153 


3.705 


1.00 


28.08 


25 


ATOM 


492 


CB 


LEU 


370 


20.020 


8.339 


5.003 


1.00 


28.81 




ATOM 


493 


CG 


LEU 


370 


19.523 


9.395 


6.000 


1.00 


28.74 




ATOM 


494 


CD1 


LEU 


370 


20.315 


9.275 


7.299 


1.00 


30.81 




ATOM 


495 


CD2 


LEU 


370 


19.693 


10.792 


5.404 


1.00 


29.77 




ATOM 


496 


c 


LEU 


370 


19.884 


7.043 


2.893 


1.00 


31.25 


30 


ATOM 


497 


o 


LEU 


370 


19.341 


5.943 


2.784 


1.00 


31.78 




ATOM 


498 


N 


THR 


371 


21.052 


7.333 


2.331 


1.00 


28.86 




ATOM 


499 


CA 


THR 


371 


21.793 


6.336 


1.569 


1.00 


32.90 




ATOM 


500 


CB 


THR 


371 


22.979 


6.944 


0.818 


1.00 


33.44 




ATOM 


501 


OGI 


THR 


371 


23.880 


7.523 


1.766 


1.00 


34.59 


35 


ATOM 


502 


CG2 


THR 


371 


22.514 


8.002 


-0.178 


1.00 


32.63 




ATOM 


503 


C 


THR 


371 


22.373 


5.315 


2.539 


1.00 


35.31 




ATOM 


504 


o 


THR 


371 


22.536 


5.591 


3.733 


1.00 


31.27 




ATOM 


505 


N 


LEU 


372 


22.702 


4.141 


2.015 


1.00 


34.34 




ATOM 


506 


CA 


LEU 


372 


23.273 


3.073 


2.822 


1.00 


35.46 


40 


ATOM 


507 


CB 


LEU 


372 


23.518 


1.841 


1.944 


1.00 


37.73 




ATOM 


508 


CG 


LEU 


372 


24.362 


0.704 


2.515 


1.00 


42.43 




ATOM 


509 


CD1 


LEU 


372 


23.690 


0.145 


3.757 


1.00 


45.60 




ATOM 


510 


CD2 


LEU 


372 


24.534 


-0.383 


1.455 


1.00 


44.29 




ATOM 


511 


C 


LEU 


372 


24.587 


3.548 


3.444 


1.00 


36.95 


45 


ATOM 


512 


O 


LEU 


372 


24.813 


3.374 


4.643 


1.00 


35.57 




ATOM 


513 


N 


HIS 


373 


25.442 


4.159 


2.627 


1.00 


35.68 




ATOM 


514 


CA 


HIS 


373 


26.729 


4.656 


3.099 


1.00 


36.60 




ATOM 


515 


CB 


HIS 


373 


27.506 


5.282 


1.935 


1.00 


44.01 
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10 



15 



20 



25 



30 



35 



40 



45 



ATOM 


516 


CG 


HIS 


373 


28.538 


6.1 > . 


2.360 


1.00 


50.69 


ATOM 


517 


CD2 


HIS 


373 


29.857 


6.i 1-8 


2.636 


1.00 


54.69 


ATOM 


518 


ND1 


HIS 


373 


28.246 


7.613 


2.561 


1.00 


53.77 


ATOM 


519 


CE1 


HIS 


373 


29.339 


8.248 


2.945 


1.00 


57.09 


ATOM 


520 


NE2 


HIS 


373 


30.331 


7.376 


2.999 


1.00 


57.23 


ATOM 


521 


C 


HIS 


373 


26.575 


5.669 


4.244 


1.00 


36.22 


ATOM 


522 


O 


HIS 


373 


27.350 


5.650 


5.201 


1.00 


33.05 


ATOM 


523 


N 


ASP 


374 


25.580 


6.549 


4.148 


1.00 


32.03 


ATOM 


524 


CA 


ASP 


374 


25.342 


7.541 


5.196 


1.00 


30.76 


ATOM 


525 


CB 


ASP 


374 


24.354 


8.603 


4.713 


1.00 


30.12 


ATOM 


526 


CG 


ASP 


374 


25.018 


9.672 


3.860 


1.00 


35.83 


ATOM 


527 


OD1 


ASP 


374 


26.264 


9.744 


3.842 


1.00 


34.39 


ATOM 


528 


OD2 


ASP 


374 


24.291 


10.440 


3.199 


1.00 


35.39 


ATOM 


529 


C 


ASP 


374 


24.805 


6.876 


6.472 


1.00 


30.33 


ATOM 


530 


O 


ASP 


374 


25.152 


7.275 


7.587 


1.00 


27.04 


ATOM 


531 


N 


GLN 


375 


23.944 


5.877 


6.309 


1.00 


25.71 


ATOM 


532 


CA 


GLN 


375 


23.403 


5.157 


7.454 


1.00 


26.68 


ATOM 


533 


CB 


GLN 


375 


22.424 


4.077 


6.993 


1.00 


29.70 


ATOM 


534 


CG 


GLN 


375 


21.101 


4.616 


6.484 


1.00 


29.16 


ATOM 


535 


CD 


GLN 


375 


20.219 


3.514 


5.940 


1.00 


35.87 


ATOM 


536 


OE1 


GLN 


375 


20.155 


2.426 


6.510 


1.00 


30.97 


ATOM 


537 


NE2 


GLN 


375 


19.541 


3.785 


4.827 


1.00 


34.51 


ATOM 


538 


C 


GLN 


375 


24.556 


4.502 


8.214 


1.00 


25.51 


ATOM 


539 


O 


GLN 


375 


24.585 


4.513 


9.442 


1.00 


28.14 


ATOM 


540 


N 


VAL 


376 


25.504 


3.938 


7.475 


1.00 


26.62 


ATOM 


. 541 


CA 


VAL 


376 


26.659 


3.281 


8.071 


1.00 


29.24 


ATOM 


542 


CB 


VAL 


376 


27.531 


2.597 


7.003 


1.00 


29.66 


ATOM 


543 


CGI 


VAL 


376 


28.812 


2.071 


7.635 


1.00 


28.29 


ATOM 


544 


CG2 


VAL 


376 


26.745 


1.469 


6.341 


1.00 


29.90 


ATOM 


545 


C 


VAL 


376 


27.526 


4.285 


8.821 


1.00 


30.87 


ATOM 


546 


O 


VAL 


376 


27.953 


4.029 


9.948 


1.00 


30.09 


ATOM 


—547— 


— N 


HIS 


377 


27.785 


— 5.428 


8.191 


1.00 


28.05 






















ATOM 


548 


CA 


HIS 


377 


28.602 


6.457 


8.814 


1.00 


28.68 


ATOM 


549 


CB 


HIS 


377 


28.792 


7.639 


7.864 


1.00 


30.26 


ATOM 


550 


CG 


HIS 


377 


29.508 


8.791 


8.488 


1. 00 


33.89 


ATOM 


551 


CD2 


HIS 


377 


29.073 


10.017 


8.863 


1.00 


34.99 


ATOM 


552 


ND1 


HIS 


377 


30.846 


8.740 


8.823 


1.00 


37.01 


ATOM 


553 


CE1 


HIS 


377 


31.201 


9.884 


9.377 


1.00 


34.79 


ATOM 


554 


NE2 


HIS 


377 


30.144 


10.677 


9.413 


1.00 


34.95 


ATOM 


555 


C 


HIS 


377 


27.983 


6.954 


10.114 


1.00 


25.13 


ATOM 


556 


O 


HIS 


377 


28.677 


7.102 


11.115 


1.00 


25.93 


ATOM 


557 


N 


LEU 


378 


26.678 


7.206 


10.107 


1.00 


24.58 


ATOM 


558 


CA 


LEU 


378 


26.015 


7.695 


11.315 


1.00 


26.40 


ATOM 


559 


CB 


LEU 


378 


24.542 


8.001 


11.027 


1.00 


26.29 


ATOM 


560 


CG 


LEU 


378 


24.291 


9.180 


10.073 


1. 00 


28.06 


ATOM 


561 


CD1 


LEU 


378 


22.778 


9.353 


9.869 


1.00 


27.66 


ATOM 


562 


CD2 


LEU 


378 


24.91 1 


10.458 


10.642 


1.00 


30.08 


ATOM 


563 


C 


LEU 


378 


26.120 


6.695 


12.459 


1.00 


28.55 
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ATOM 


564 


o 


LEU 


378 




ATOM 


565 


N 


LEU 


379 




ATOM 


566 


CA 


LEU 


379 




ATOM 


567 


CB 


LEU 


379 


5 


ATOM 


568 


CG 


LEU 


379 




ATOM 


569 


CD1 


LEU 


379 




ATOM 


570 


CD2 


LEU 


379 




ATOM 


571 


c 


LEU 


379 




ATOM 


572 


O 


LEU 


379 


10 

I \J 


ATOM 
rv 1 W ivi 


573 


N 


GLU 


380 




ATOM 


574 


CA 


GLU 


380 




ATOM 


575 


CB 


GLU 


380 




ATOM 


576 


CG 


GLU 


380 




ATOM 


577 


CD 


GLU 


380 


1 c 
1 3 




578 


OE1 


GLU 


380 




ATOM 


579 


OE2 


GLU 


380 




ATOM 


580 


r 


GLU 


380 




ATOM 


581 


o 


GLU 


380 




ATOM 


58? 


N 


ACYS 


381 


on 


A TOM 


58^ 


N 


BCYS 381 




ATOM 


584 


CA 


ACYS 381 




ATOM 


585 


CA 


BCYS 381 




ATOM 


586 


CR 


ACYS 381 




ATOM 


587 


CB 


BCYS 381 


is 


ATOM 


588 


SG 


ACYS 381 




ATOM 


589 

JO/ 


SG 


BCYS 381 




ATOM 


5Q0 


c 


ACYS 381 




ATOM 


591 


c 


BCYS 381 




ATOM 


59? 


o 

V-/ 


ACYS 381 




ATOM 


501 


o 


BCYS 381 




ATOM 


5Q4 


N 


ALA 


382 




ATOM 


595 


CA 


ALA 


382 




ATOM 


596 


CB 


ALA 


382 




ATOM 


597 


c 


ALA 


382 




ATOM 


598 


o 


ALA 


382 




ATOM 


599 


N 


TRP 


383 




ATOM 


600 


CA 


TRP 


383 




ATOM 


601 


CB 


TRP 


383 




ATOM 


602 


CG 


TRP 


383 


40 


ATOM 


603 


CD2 


TRP 


383 




ATOM 


604 


CE2 


TRP 


383 




ATOM 


605 


CE3 


TRP 


383 




ATOM 


606 


CD1 


TRP 


383 




ATOM 


607 


NE1 


TRP 


383 


45 


ATOM 


608 


CZ2 


TRP 


383 




ATOM 


609 


CZ3 


TRP 


383 




ATOM 


610 


CH2 


TRP 


383 




ATOM 


611 


C 


TRP 


383 



26.379 


7.075 


13.605 


1.00 


24.76 


25.919 


5.414 


12.153 


1.00 


24.29 


26.000 


4.388 


13.182 


1.00 


27.03 


25.401 


3.073 


12.667 


1.00 


28.53 


23.875 


3.023 


12.845 


1.00 


30.29 


23.248 


1.943 


1 1 .963 


1.00 


33.04 


23.563 


2.759 


14.312 


1.00 


29.45 


27.430 


4.176 


13.670 


1.00 


27.18 


27.653 


3.979 


14.866 


1.00 


25.95 


28.402 


4.236 


12.762 


1.00 


25.86 


29.786 


4.054 


13.173 


1.00 


27.58 


30.730 


4.036 


11.968 


1.00 


30.36 


32.172 


3.785 


12.380 


1.00 


37.98 


33.080 


3.471 


11.210 


1.00 


45.23 


32.869 


4.048 


10.120 


1.00 


42.99 


34.004 


2.646 


11.386 


1.00 


45.79 


30.218 


5.159 


14.133 


1.00 


27.50 


31.056 


4.937 


15.010 


1.00 


26.67 


29.637 


6.339 


13.965 


0.75 


24.89 


29.645 


6.352 


13.980 


0.25 


25.79 


29.969 


7.466 


14.826 


0.75 


24.12 


29.993 


7.481 


14.847 


0.25 


24.86 


29.621 


8.781 


14.122 


0.75 


25.96 


29.766 


8.814 


14.115 


0.25 


25.62 


30.698 


9.192 


12.732 


0.75 


31.63 


30.227 


10.312 


15.059 


0.25 


25.40 


29.237 


7.422 


16.162 


0.75 


22.07 


29.21 1 


7.498 


16.159 


0.25 


23.97 


29.812 


7.730 


17.206 


0.75 


21.97 


29.724 


7.940 


17.187 


0.25 


23.99 


27.974 


7.012 


16.128 


1.00 


23.41 


27.140 


7.015 


17.318 


1.00 


22.83 


25.785 


7.587 


16.948 


1.00 


25.50 


26.913 


5.755 


18.131 


1.00 


25.39 


26.374 


5.837 


19.234 


1.00 


23.09 


27.311 


4.602 


17.615 


1.00 


25.98 


27.026 


3.354 


18.318 


1.00 


23.80 


27.669 


2.172 


17.580 


1.00 


22.52 


29.130 


2.054 


17.762 


1.00 


24.42 


29.797 


1.347 


18.803 


1.00 


27.31 


31.182 


1.484 


18.579 


1.00 


28.24 


29.360 


0.609 


19.912 


1.00 


27.37 


30.102 


2.578 


16.965 


1.00 


24.58 


31.342 


2.239 


17.446 


1.00 


27.35 


32.133 


0.909 


19.420 


1.00 


28.76 


30.305 


0.039 


20.745 


1.00 


28.09 


31.674 


0.191 


20.496 


1.00 


29.77 


27.356 


3.309 


19.802 


1.00 


23.54 
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ATOM 


612 


o 


TRP 


383 


26.526 


2.866 


20.584 


1.00 


22.90 




ATOM 


613 


N 


LEU 


384 


28.542 


3.765 


20.21 1 


1.00 


20.37 




ATOM 


614 


CA 


LEU 


384 


28.864 


3.713 


21.640 


1.00 


22.41 




ATOM 


615 


CB 


LEU 


384 


30.369 


3.890 


21.883 


1.00 


24.98 


5 


ATOM 


616 


CG 


LEU 


384 


30.824 


3.645 


23.336 


1.00 


27.33 




ATOM 


617 


CD1 


LEU 


384 


30.273 
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1874 


CB 


LEU 


541 


28.591 


-2.152 


5.145 


1.00 


39.67 




ATOM 


1875 


CG 


LEU 


541 


28.301 


-2.112 


3.643 


1.00 


40.92 




ATOM 


1876 


CD1 


LEU 


541 


27.184 


-1.130 


3.348 


1.00 


42.44 




ATOM 


1877 


CD2 


LEU 


541 


29.572 


-1.716 


2.908 


1.00 


44.18 




ATOM 


1878 


C 


LEU 


541 


26.723 


-3.676 


5.833 


1.00 


39.75 


20 


ATOM 


1879 


O 


LEU 


541 


25.616 


-3.713 


5.297 


1.00 


36.48 




ATOM 


1880 


N 


GLU 


542 


27.366 


-4.770 


6.230 


1.00 


40.88 




ATOM 


1881 


CA 


GLU 


542 


26.790 


-6.097 


6.037 


1.00 


41.89 




ATOM 


1882 


CB 


GLU 


542 


27.719 


-7.170 


6.620 


1.00 


44.11 




ATOM 


1883 


CG 


GLU 


542 


27.010 


-8.457 


7.052 


1.00 


50.60 


25 


ATOM 


1884 


CD 


GLU 


542 


26.434 


-9.245 


5.887 


1.00 


55.80 




ATOM 


1885 


OE1 


GLU 


542 


25.570 


-10.117 


6.130 


1.00 


58.81 




ATOM 


1886 


OE2 


GLU 


542 


26.842 


-8.996 


4.728 


1.00 


57.19 




ATOM 


1887 


C 


GLU 


542 


25.414 


-6.195 


6.691 


1.00 


41.58 




ATOM 


1888 


o 


GLU 


542 


24.472 


-6.720 


6.102 


1.00 


42.82 


30 


ATOM 


1889 


N 


MET 


543 


25.298 


-5.686 


7.915 


1.00 


40.09 




ATOM 


1890 


CA 


MET 


543 


24.036 


-5.731 


8.634 


1.00 


36.43 




ATOM 


1891 


CB 


MET 


543 


24.270 


-5.424 


10.1 1 1 


1.00 


39.95 




ATOM 


1892 


CG 


MET 


543 


25.137 


-6.459 


1 0.808 


1.00 


41.95 




ATOM 


1893 


SD 


MET 


543 


24.918 


-6.445 


12.604 


1.00 


47.17 


35 


ATOM 


1894 


CE 


MET 


543 


25.324 


-4.749 


12.964 


1.00 


40.88 




ATOM 


1895 


C 


MET 


543 


23.001 


-4.769 


8.072 


1.00 


35.02 




ATOM 


1896 


o 


MET 


543 


21.808 


-3.073 


O Z\ A O 

8.048 


1.00 


35.31 




ATOM 


1897 


N 


LEU 


r a A 

544 


23.457 


-3.605 


7.629 


1.00 


j2.90 




ATOM 


1 or\o 

1898 


CA 


LEU 


C A A 

544 


22.559 


*^ z* z\ o 

-2.603 


7.074 


1 f\f\ 

1.00 


j6.88 


A f\ 

40 


ATOM 


1 899 


CB 


LEU 


C A A 

544 


23.225 


1 ZT 

-1.226 


7.111 


1 f\f\ 

1.00 


j4.51 




ATOM 


1 AAA 

1900 


CG 


LEU 


£ A A 

544 


2j.268 


-0.562 


O A f\f\ 

8.490 


1.00 


j 1.94 




ATOM 


1901 


CD1 


LEU 


544 


24.284 


0.564 


8.478 


1.00 


32.27 




ATOM 


1902 


CD2 


LEU 


544 


21.897 


-0.029 


8.846 


1.00 


29.02 




ATOM 


1903 


C 


LEU 


544 


22.148 


-2.941 


5.640 


1.00 


38.94 


45 


ATOM 


1904 


O 


LEU 


544 


20.971 


-2.842 


5.294 


1.00 


39.52 




ATOM 


1905 


N 


ASP 


545 


23.1 18 


-3.338 


4.817 


1.00 


41.05 




ATOM 


1906 


CA 


ASP 


545 


22.850 


-3.685 


3.418 


1.00 


40.78 




ATOM 


1907 


CB 


ASP 


545 


24.159 


-3.780 


2.620 


1.00 


37.75 
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ATOM 


1908 


CG 


ASP 


545 


23.922 


-3.937 


1.120 


1.00 


35.19 




ATOM 


1909 


OD1 


ASP 


545 


24.881 


-4.265 


0.380 


1.00 


33.48 




ATOM 


1910 


OD2 


ASP 


545 


22.768 


-3.734 


0.691 


1.00 


31.33 




ATOM 


1911 


c 


ASP 


545 


22.116 


-5.015 


3.349 


1.00 


42.87 


5 


ATOM 


1912 


O 


ASP 


545 


22.681 


-6.030 


2.929 


1.00 


44.32 




ATOM 


1913 


N 


ALA 


546 


20.853 


-5.009 


3.755 


1.00 


43.49 




ATOM 


1914 


CA 


ALA 


546 


20.069 


-6.229 


3.746 


1.00 


46.96 




ATOM 


1915 


CB 


ALA 


546 


19.213 


-6.305 


5.006 


1.00 


47.82 




ATOM 


1916 


c 


ALA 


546 


19.193 


-6.362 


2.508 


1.00 


49.55 


10 


ATOM 


1917 


o 


ALA 


546 


18.804 


-5.368 


1.883 


1.00 


48.75 




ATOM 


1918 


N 


HIS 


547 


18.895 


-7.606 


2.152 


1.00 


50.98 




ATOM 


1919 


CA 


HIS 


547 


18.042 


-7.884 


1.006 


1.00 


53.77 




ATOM 


1920 


CB 


HIS 


547 


18.431 


-9.223 


0.369 


1.00 


52.69 




ATOM 


1921 


CG 


HIS 


547 


18.395 


-10.382 


1.317 


1. 00 


55.05 


15 


ATOM 


1922 


CD2 


HIS 


547 


17.477 


-10.752 


2.242 


1.00 


53.94 




ATOM 


1923 


ND1 


HIS 


547 


19.395 


-11.329 


1.371 


1.00 


56.23 




ATOM 


1924 


CE1 


HIS 


547 


19.095 


-12.232 


2.286 


1.00 


55.36 




ATOM 


1925 


NE2 


HIS 


547 


17.936 


-11.906 


2.830 


1.00 


57.01 




ATOM 


1926 


c 


HIS 


547 


16.603 


-7.936 


1.518 


1.00 


55.69 


20 


ATOM 


1927 


o 


HIS 


547 


16.362 


-7.796 


2.720 


1.00 


54.30 




ATOM 


1928 


N 


ARG 


548 


15.653 


-8.139 


0.612 


1.00 


57.00 




ATOM 


1929 


CA 


ARG 


548 


14.245 


-8.212 


0.987 


1.00 


60.65 




ATOM 


1930 


CB 


ARG 


548 


13.432 


-7.171 


0.208 


1.00 


62.69 




ATOM 


1931 


CG 


ARG 


548 


14.272 


-6.222 


-0.637 


1.00 


67.54 


25 


ATOM 


1932 


CD 


ARG 


548 


13.448 


-5.061 


-1.171 


1.00 


71.92 




ATOM 


1933 


NE 


ARG 


548 


13.702 


-3.826 


-0.432 


1.00 


76.95 




ATOM 


1934 


CZ 


ARG 


548 


14.864 


-3.178 


-0.429 


1.00 


79.04 




ATOM 


1935 


NH1 


ARG 


548 


15.891 


-3.644 


-1.128 


1.00 


80.66 




ATOM 


1936 


NH2 


ARG 


548 


15.001 


-2.063 


0.278 


1.00 


80.39 


30 


ATOM 


1937 


c 


ARG 


548 


13.695 


-9.608 


0.711 


1.00 


61.65 




ATOM 


1938 


o 


ARG 


548 


12.500 


-9.781 


0.466 


1.00 


62.05 




ATOM 


1939 


N 


LEU 


549 


14.576 


-10.603 


0.756 


1.00 


62.39 




ATOM 


1940 


CA 


LEU 


549 


14.188 


-11.985 


0.507 


1.00 


64.02 




ATOM 


1941 


CB 


LEU 


549 


15.433 


-12.828 


0.195 


1.00 


62.14 


35 


ATOM 


1942 


CG 


LEU 


549 


16.461 


-12.191 


-0.753 


1.00 


60.76 




ATOM 


1943 


CD1 


LEU 


549 


17.699 


-13.074 


-0.878 


1.00 


57.77 




ATOM 


1944 


CD2 


LEU 


549 


15.823 


-11.972 


-2.108 


1.00 


58.38 




ATOM 


1945 


c 


LEU 


549 


13.431 


-12.574 


1.702 


1.00 


66.65 




ATOM 


1946 


o 


LEU 


549 


12.759 


-13.600 


1.577 


1.00 


67.15 


40 


ATOM 


1947 


N 


HIS 


550 


13.541 


-11.920 


2.856 


1.00 


67.72 




ATOM 


1948 


CA 


HIS 


550 


12.858 


-12.378 


4.065 


1.00 


69.93 




ATOM 


1949 


CB 


HIS 


550 


13.753 


-12.190 


5.298 


1.00 


70.76 




ATOM 


1950 


CG 


HIS 


550 


14.977 


-13.054 


5.306 


1.00 


71.50 




ATOM 


1951 


CD2 


HIS 


550 


15.539 


-13.821 


4.341 


1.00 


71.63 


45 


ATOM 


1952 


ND1 


HIS 


550 


15.793 


-13.172 


6.411 


1.00 


71.98 




ATOM 


1953 


CE1 


HIS 


550 


16.805 


-13.972 


6.126 


1.00 


72.04 




ATOM 


1954 


NE2 


HIS 


550 


16.674 


-14.379 


4.876 


1.00 


71.39 




ATOM 


1955 


C 


HIS 


550 


11.556 


-11.603 


4.275 


1.00 


71.15 
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ATOM 


1956 


O 


HIS 


550 


10.940 


-1 1.684 


5.340 


1.00 


70.66 




ATOM 


1957 


N 


ALA 


551 


11.143 


-10.851 


3.258 


1.00 


72.22 




ATOM 


1958 


CA 


ALA 


551 


9.919 


-10.057 


3.338 


1.00 


73.58 




ATOM 


1959 


CB 


ALA 


551 


9.904 


-9.014 


2.221 


1. 00 


73.21 


5 


ATOM 


I960 


C 


ALA 


551 


8.658 


-10.920 


3.266 


1.00 


74.69 




ATOM 


1961 


O 


ALA 


551 


7.684 


-10.474 


2.621 


1.00 


76.12 




ATOM 


1962 


OXT 


ALA 


551 


8.651 


-12.025 


3.852 


1.00 


73.79 




HETATM 


1963 


C10 


OHT 


600 


30.581 


1.481 


29.471 


1.00 


26.84 




HETATM 


1964 


C9 


OHT 


600 


30.713 


-0.043 


29.358 


1.00 


22.85 


10 


HETATM 


1965 


C8 


OHT 


600 


31.366 


-0.385 


28.037 


1.00 


25.56 




HETATM 


1966 


Cll 


OHT 


600 


32.761 


0.051 


27.916 


1.00 


27.51 




HETATM 


1967 


C16 


OHT 


600 


33.218 


0.797 


26.797 


1.00 


28.35 




HETATM 


1968 


C15 


OHT 


600 


34.551 


1.237 


26.747 


1.00 


30.39 




HETATM 


1969 


C14 


OHT 


600 


35.443 


0.923 


27.792 


1.00 


30.23 


15 


HETATM 


1970 


C13 


OHT 


600 


35.004 


0.185 


28.890 


1.00 


31.45 




HETATM 


1971 


C12 


OHT 


600 


33.666 


-0.241 


28.955 


1.00 


27.93 




HETATM 


1972 


C7 


OHT 


600 


30.682 


-1.089 


27.077 


1.00 


24.41 




HETATM 


1973 


CI 


OHT 


600 


29.21 1 


-1.258 


27.052 


1.00 


24.26 




HETATM 


1974 


C2 


OHT 


600 


28.644 


-2.526 


26.706 


1.00 


25.92 


20 


HETATM 


1975 


C3 


OHT 


600 


27.254 


-2.668 


26.580 


1.00 


26.32 




HETATM 


1976 


C4 


OHT 


600 


26.438 


-1.553 


26.813 


1.00 


29.02 




HETATM 


1977 


04 


OHT 


600 


25.072 


-1.605 


26.716 


1.00 


28.42 




HETATM 


1978 


C5 


OHT 


600 


26.980 


-0.286 


27.130 


1.00 


26.98 




HETATM 


1979 


C6 


OHT 


600 


28.362 


-0.147 


27.231 


1.00 


25.23 


25 


HETATM 


1980 


C17 


OHT 


600 


31.370 


-1.692 


25.942 


1.00 


26.61 




HETATM 


1981 


C18 


OHT 


600 


32.508 


-2.498 


26.151 


1.00 


26.77 




HETATM 


1982 


C19 


OHT 


600 


33.166 


-3.052 


25.072 


1.00 


27.50 




HETATM 


1983 


C20 


OHT 


600 


32.676 


-2.794 


23.786 


1.00 


27.50 




HETATM 


1984 


O20 


OHT 


600 


33.206 


-3.566 


22.795 


1.00 


31.35 


30 


HETATM 


1985 


C23 


OHT 


600 


33.009 


-3.135 


21.448 


1.00 


40.09 




HETATM 


1986 


C24 


OHT 


600 


34.226 


-3.490 


20.575 


1.00 


44.80 




HETATM 


1987 


N24 


OHT 


600 


34.141 


-4.901 


20.203 


1.00 


49.00 




HETATM 


1988 


C25 


OHT 


600 


33.375 


-5.040 


18.933 


1.00 


51.64 




HETATM 


1989 


C26 


OHT 


600 


35.495 


-5.459 


20.004 


1.00 


52.06 


35 


HETATM 


1990 


C21 


OHT 


600 


31.540 


-2.005 


23.558 


1.00 


27.19 




HETATM 


1991 


C22 


OHT 


600 


30.892 


-1.450 


24.645 


1.00 


27.92 




HETATM 


1992 


Ol 


HOH 


1 


20.714 


-12.010 


23.057 


1.00 


27.20 




HETATM 


1993 


Ol 


HOH 


2 


22.563 


-0.070 


25.819 


1.00 


25.77 




HETATM 


1994 


Ol 


HOH 


3 


25.183 


19.202 


23.149 


1.00 


42.52 


40 


HETATM 


1995 


Ol 


HOH 


4 


35.158 


5.823 


37.390 


1.00 


33.92 




HETATM 


1996 


Ol 


HOH 


5 


22.116 


-9.922 


18.914 


1.00 


30.18 




HETATM 


1997 


Ol 


HOH 


6 


29.812 


6.536 


19.652 


1.00 


26.1 1 




HETATM 


1998 


Ol 


HOH 


7 


13.362 


4.463 


20.376 


1.00 


29.40 




HETATM 


1999 


Ol 


HOH 


8 


19.799 


-11.295 


20.187 


1.00 


28.70 


45 


HETATM 


2000 


Ol 


HOH 


9 


21.205 


1.466 


23.794 


1.00 


22.47 




HETATM 


2001 


Ol 


HOH 


10 


21.177 


-4.961 


29.066 


1.00 


33.00 




HETATM 


2002 


Ol 


HOH 


11 


18.591 


1.863 


20.518 


1.00 


32.59 




HETATM 


2003 


Ol 


HOH 


12 


16.298 


21.566 


15.992 


1.00 


33.42 
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CLAIMS 

WHAT IS CLAIMED IS: 

1 . A method of identifying a compound that modulates nuclear receptor activity, said method 
comprising: 

modeling test compounds that fit spatially into a nuclear receptor ligand binding domain of 
interest using an atomic structural model of the estrogen receptor a ligand binding domain 
or portion thereof, 

screening said test compounds in an assay characterized by binding of a test compound to 
the ligand binding domain, and 

identifying a test compound that modulates nuclear receptor activity, wherein said atomic 
structural model comprises atomic coordinates of amino acid residues corresponding to 
residues of human estrogen receptor a Met343, Leu346, Ala350, Glu353, Leu384, Leu387, 
Leu391, Arg394, Phe404, Met42l, Leu428, Gly52L His524, Leu525 and Met528. 

2. The method of Claim 1 wherein the amino acid residues correspond to residues Met343, 
Met421, His524, Leu525 and Met528. 

3. The method of Claim 1 wherein the test compound is an agonist and nuclear receptor 
activity is measured by binding of a coactivator to the coactivator binding site. 

4. The method of Claim 1 wherein the test compound is an antagonist and nuclear receptor 
activity is measured by the unwinding of helix 12. 

5. The method of Claim 1 wherein the test compound is an antagonist and nuclear receptor 
activity is measured by the blocking of coactivator binding. 

6. The method of Claim 1 wherein said screening is in vitro. 

7. The method of Claim 6 wherein said screening is high throughput screening. 

8. The method of Claim 1 wherein said test compound is from a library of compounds. 

9. The method of Claim 1 wherein said test compound is a small organic molecule, a peptide, 
or peptidomimetic. 

10. The method of Claim 1 which further comprises the step of providing the atomic 
coordinates of the estrogen receptor a ligand binding domain or portion thereof to a 
computerized modeling system, prior to said modeling step. 

1 1 . The method of Claim 1 wherein said nuclear receptor is selected from the group consisting 
of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid receptors. 
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progestin receptors, mineralocorticoid receptors, androgen receptors, peroxisome receptors 
and vitamin D receptors. 

12. The method of Claim 1 1 wherein said nuclear receptor is an estrogen receptor. 

13. The method of Claim 12 wherein said estrogen receptor is the estrogen receptor a. 

14. A method of identifying a compound that modulates ligand binding to a nuclear receptor, 
said method comprising: 

modeling test compounds that fit spacially into a nuclear receptor ligand binding domain of 
interest using an atomic structural model of the estrogen receptor a ligand binding domain 
or portion thereof, 

screening said test compounds in an assay characterized by binding of a test compound to 
the binding domain, and 

identifying a test compound that modulates ligand binding to said nuclear receptor, 
wherein said atomic structural model comprises atomic coordinates of amino acid residues 
corresponding to residues of human estrogen receptor a Met343. Leu346, Ala350, Glu353, 
Leu384, Leu387, Lcu391, Arg394, Phe404, Met421, Leu428. Gly521, His524, Leu525 and 
Met528. 

15. The method of Claim 14 wherein the amino acid residues correspond to residues Met343, 
Met421, His524, Leu525 and Met528. 

16. The method of Claim 14 wherein said nuclear receptor is selected from the group 
consisting of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid 
receptors, progestin receptors, mineralocorticoid receptors, androgen receptors, 
peroxisome receptors and vitamin D receptors. 

17. The method of Claim 16 wherein said nuclear receptor is an estrogen receptor. 

1 8. The method of Claim 1 7 wherein said estrogen receptor is the estrogen receptor a. 

19. The method of Claim 14 wherein said screening is in vitro. 

20. The method of Claim 19 wherein said screening is high throughput screening. 

21. The method of Claim 14 wherein said test compound is from a library of compounds. 

22. The method of Claim 14 wherein said test compound is an agonist or antagonist of ligand 
binding. 

23. The method of Claim 14 wherein said test compound is a small organic molecule, a 
peptide, or peptidomimetic. 
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24. A method for identifying an agonist or antagonist of ligand binding to a nuclear receptor, 
said method comprising the steps of: 

providing the atomic coordinates of the estrogen receptor a ligand binding domain or 
portion thereof to a computerized modeling system, wherein said atomic coordinates are of 
the amino acid residues corresponding to residues of human estrogen receptor a Met343. 
Leu346. AIa350. GIu353, Leu384. Leu387. Leu391, Arg394. Phe404, Met42L Leu428, 
Gly52 1 , His524, Leu525 and Met528; 

modeling compounds which fit spacially into the ligand binding domain: and 
identifying in an assay for nuclear receptor activity a compound which increases or 
decreases the activity of the nuclear receptor by binding the ligand binding domain of said 
nuclear receptor, whereby an agonist or antagonist of ligand binding is identified. 

25. The method of Claim 24 wherein the amino acid residues correspond to residues Met343, 
Met421. His524. Leu525 and Met528. 

26. The method of Claim 24 wherein said nuclear receptor is selected from the group 
consisting of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid 
receptors, progestin receptors, mineralocorticoid receptors; androgen receptors, 
peroxisome receptors and vitamin D receptors. 

27. The method of Claim 26 wherein said nuclear receptor is an estrogen receptor. 

28. The method of Claim 27 wherein said estrogen receptor is the estrogen receptor a. 

29. A method of modulating nuclear receptor activity in a mammal by administering to a 
mammal in need thereof a sufficient amount of a compound that fits spatially and 
preferentially into a ligand binding domain of a nuclear receptor of interest, where said 
compound is designed so as to distort at least one amino acid residue corresponding to 
residues of human estrogen receptor a Met343. Leu346, Ala350, Glu353. Lcu384, Leu387, 
Leu391, Arg394, Phe404, Met42l, Leu428. GIy521, His524, Leu525 and Met528. 

30. The method of Claim 29 wherein at least one amino acid residue corresponds to residues 
Met343. Met421, His524, Leu525 and Met528. 

3 1 . The method of Claim 29 wherein said nuclear receptor is selected from the group 
consisting of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid 
receptors, progestin receptors, mineralocorticoid receptors, androgen receptors, 
peroxisome receptors and vitamin D receptors. 

32. The method of Claim 3 1 wherein said nuclear receptor is an estrogen receptor. 
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33. The method of Claim 32 wherein said estrogen receptor is the estrogen receptor a. 

34. A machine-readable data storage medium, comprising a data storage material encoded with 
machine readable data which, when using a machine programmed with instructions for 
using said data, is capable of displaying a graphical three-dimensional representation of a 

5 molecular complex of a compound bound to a nuclear receptor ligand binding domain 

comprising structure coordinates of amino acids corresponding to human estrogen receptor 
a Met343, Leu346, AIa350, Glu353, Leu384, Leu387. Leu391, Arg394, Phe404, Met421, 
Leu428, Gly521, His524, Leu525 and Met528 or a homologue of said molecular complex, 
wherein said homologue comprises a ligand binding domain that has a root mean square 
1 0 deviation from the backbone atoms of said amino acids of not more than 1 .5A. 

35. The machine-readable data storage medium of Claim 34 wherein the amino acid residues 
correspond to residues Mct343, Met421, His524, Leu525 and Met528. 

36. The machine-readable data storage medium of Claim 34 wherein said nuclear receptor is 
selected from the group consisting of estrogen receptors, thyroid receptors, retinoid 

15 receptors, glucocorticoid receptors, progestin receptors, mineralocorticoid receptors, 

androgen receptors, peroxisome receptors and vitamin D receptors. 

37. The machine-readable data storage medium of Claim 36 wherein said nuclear receptor is 
an estrogen receptor. 

38. The machine-readable data storage medium of Claim 37 wherein said estrogen receptor is 
20 the estrogen receptor a. 

39. The machine-readable data storage medium of Claim 34 wherein said molecular complex 
is defined by the set of structure coordinates depicted in Appendix 1 or Appendix 2, or a 
homologue of said molecular complex, said homologue having a root mean square 
deviation from the backbone atoms of said amino acids of not more than 1.5A. 

25 40. A machine-readable data storage medium comprising a data storage material encoded with 
a first set of machine readable data which, when combined with a second set of machine 
readable data, using a machine programmed with instructions for using said first set of data 
and said second set of data, can determine at least a portion of the structure coordinates 
corresponding to the second set of machine readable data, wherein: said first set of data 

30 comprises a Fourier transform of at least a portion of the structural coordinates selected 

from the group consisting of coordinates depicted in Appendix 1 or Appendix 2; and said 
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second set of data comprises an X-ray diffraction pattern of a molecule or molecular 
complex. 



41 . The machine-readable data storage medium of Claim 40 wherein said nuclear receptor is 
selected from the croup consisting of estrogen receptors, thyroid receptors, retinoid 

5 receptors, glucocorticoid receptors, progestin receptors, mineralocorticoid receptors, 

androgen receptors, peroxisome receptors and vitamin D receptors. 

42. The machine-readable data storage medium of Claim 41 wherein said nuclear receptor is 
an estrogen receptor. 

43. The machine-readable data storage medium of Claim 42 wherein said estrogen receptor is 
1 0 the estrogen receptor a. 



44. A cocrystal of a nuclear receptor comprising an agonist bound to the ligand binding 
domain and a molecule bound to the coactivator binding site of the nuclear receptor, 
wherein said crystal defracts with at least 2. 03 A resolution. 

45. The cocrystal of Claim 44 wherein said nuclear receptor is the estrogen receptor a. 
1 5 46. The cocrystal of Claim 45 wherein said estrogen receptor a is human. 

47. The cocrystal of Claim 46 wherein said molecule is peptide. 

48. The cocrystal of Claim 47 wherein said peptide comprises a NR-box amino acid sequence 
or derivative thereof. 

49. A cocrystal of a nuclear receptor comprising an antagonist bound to the ligand binding 
20 domain of the nuclear receptor, wherein said crystal defracts with at least 1 .9 A resolution. 

50. The cocrystal of Claim 49 wherein said nuclear receptor is the estrogen receptor a. 

51. The cocrystal of Claim 50 wherein said estrogen receptor a is human. 

52. A computational method of designing a nuclear receptor ligand where at least one amino 
acid residue of a nuclear receptor LBD that corresponds to human estrogen receptor a 

25 Met343, Leu346, Ala350, Glu353, Leu384, Leu387, Leu391, Arg394, Phe404, Met421, 

Leu428, Gly521, His524, Leu525 and Met528, interacts with at least one first chemical 
moiety of said ligand. comprising the step of selecting at least one chemical modification 
of said first chemical moiety to produce a second chemical moiety with a structure to either 
decrease or increase an interaction between said interacting amino acid and said second 

30 chemical moiety compared to said interaction between said interacting amino acid and said 

first chemical moiety. 
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53. The method of Claim 52 wherein at least one amino acid residue corresponds to residues 
Met343. Met42L His524. Leu525 and Met528. 

54. The method of Claim 52 further comprising determining a change in interaction between 
said interacting amino acid and said ligand after chemical modification of said first 
chemical moiety. 

55. The method of Claim 52 wherein said chemical modification enhances hydrogen bonding 
interaction, charge interaction, hydrophobic interaction. Van Der Waals interaction or 
dipole interaction between said second chemical moiety and said interacting amino acid 
compared to said first chemical moiety and said interacting amino acid. 

56. The method of Claim 52 wherein said chemical modification reduces hydrogen bonding 
interaction, charge interaction, hydrophobic interaction. Van Der Waals interaction or 
dipole interaction between said second chemical moiety and said interacting amino acid 
compared to said first chemical moiety and said interacting amino acid. 

57. The method of Claim 52 wherein said nuclear receptor is selected from the group 
consisting of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid 
receptors, progestin receptors, mineralocorticoid receptors, androgen receptors, 
peroxisome receptors and vitamin D receptors. 

58. The method of Claim 57 wherein said nuclear receptor is an estrogen receptor. 

59. The method of Claim 52 wherein the estrogen receptor is the estrogen receptor a. 

60. The method of Claim 59 wherein the ligand is an agonist. 

61. The method of Claim 60 wherein the ligand is selected from the group consisting of 17(3- 
estradiol, diethylstilbestrol, moxestrol, mesohexestrol, coumestrol, A 9 -THC, o,p-DDT, 
zearalenone and kepone. 

62. The method of Claim 61 wherein the ligand is 1 7p~estradiol, and the first chemical moiety 
is a free carbon of the A' ring located at a position selected from the group consisting of 
C6a, C7a. C 1 2a, C 1 5a, C 1 6a and C 1 7a. 

63. The method of Claim 59 wherein the ligand is an antagonist. 

64. The method of Claim 63 wherein the ligand is selected from the group consisting of ICI 
164,384 and EM800. 

65. The method of Claim 59 wherein the ligand is a selective estrogen receptor modulator. 

66. The method of Claim 65 wherein the ligand is selected from the group consisting of 
tamoxifen, raloxifene and GW5638. 
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67. A method of modulating nuclear receptor activity in a mammal by administering to a 
mammal in need thereof a sufficient amount of a ligand that fits spatially and preferentially 
into a ligand binding domain of a nuclear receptor of interest, wherein said ligand is 
designed by a computational method where at least one amino acid residue of a nuclear 
receptor ligand binding domain that corresponds to human estrogen receptor a Met343, 
Leu346, Ala350, GIu353. Leu384. Leu387. Leu39L Arg394. Phe404, Met42L Leu428. 
Gly521. His524, Leu525 and Met528, interacts with at least one first chemical moiety of 
said ligand. comprising the step of selecting at least one chemical modification of said first 
chemical moiety to produce a second chemical moiety with a structure to either decrease or 
increase an interaction between said interacting amino acid and said second chemical 
moiety compared to said interaction between said interacting amino acid and said first 
chemical moiety. 

68. The method of Claim 67 wherein at least one amino acid residue corresponds to residues 
Met343, Met42K His524, Leu525 and Met528. 

69. The method of Claim 67 wherein said ligand is an antagonist. 

70. The method of Claim 67 wherein said ligand is an agonist. 

71 . The method of Claim 70 which further comprises administering a coactivator mimic 
designed by a computational method where at least one amino acid residue of a nuclear 
receptor coactivator binding site that corresponds to human estrogen receptor a helix 3 
residues Leu354. Val355. Met357, Ile358, Ala361 and Lys362. helix 4 residue Phe367, 
helix 5 residues Gln375. Val376. Leu379 and Glu380, helix 6 residue Trp383. and helix 12 
residues Asp538. Leu539, Glu542, Met543 and Leu544, interacts with at least one first 
chemical moiety of said coactivator mimic, comprising the step of selecting at least one 
chemical modification of said first chemical moiety to produce a second chemical moiety 
with a structure to either decrease or increase an interaction between said interacting 
amino acid and said second chemical moiety compared to said interaction between said 
interacting amino acid and said first chemical moiety. 
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2.0 



<210> 1 

<211> 5 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> BINDING 
<222> (1) . . (5) 

<223> residues 2 and 3 can be any amino acid 
<400> 1 

Leu Xaa Xaa Leu Leu 
1 5 



<210> 2 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> BINDING 
<222> (1) . . (5) 

<223> residues 2 and 3 can be any amino acid 
<400> 2 

Leu Xaa Xaa Met Leu 
1 5 
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<210> 3 

<211> 5 

<212> PRT 

<213> Homo sapiens 

<400> 3 

Leu Leu Gin Met Leu 
1 5 



<210> 4 

<211> 13 

<212> PRT 

<213> Homo sapiens 
<400> 4 

Lys His Lys lie Leu His Arg Leu Leu Gin Asp Ser Ser 
1 5 10 



<210> 5 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Thr Pro Ala lie Thr Arg Val Val Asp Phe Ala Lys Lye Leu Pro Met 
1 5 10 15 

Phe Cys Glu Leu Pro Cys Glu Asp Gin He He Leu Leu Lys Gly CyB 
20 25 30 



Cys 



<210> 6 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Leu Phe Pro Pro Leu Phe Leu Glu Val Phe Glu Asp 
15 10 



<210> 7 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 7 

Thr Pro Ala lie Thr Arg Val Val Asp Phe Ala Lys Lys Leu Pro Met 
15 10 15 

Phe Ser Glu Leu Pro Cys Glu Asp Gin He He Leu Leu Lys Cys Cys 
20 25 30 
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Cys 



<210> 8 
<211> 12 
<212> PRT 

<213> Homo sapiens 



<400> 6 

Leu Phe Pro Pro Leu Phe Leu Glu Val Phe Glu Asp 
15 10 



<210> 9 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 9 

Thr Lys Cys lie lie Lys lie Val Glu Phe Ala Lys Arg Leu Pro Gly 
1 5 10 15 

Phe Thr Gly Leu Ser lie Ala Asp Gin lie Thr Leu Leu Lys Ala Ala 
20 25 30 

cys 



<210> 10 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Pro Met Pro Pro Leu lie Arg Glu Met Leu Glu Asn 
15 10 



<210> 11 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 11 

Asp Lys Gin Leu Phe Thr Leu Val Glu Trp Ala Lys Arg lie Pro His 
15 10 15 

Phe Ser Glu Leu Pro Leu Asp Asp Gin Val lie Leu Leu Arg Ala Gly 
20 25 30 

Trp 



<210> 12 
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<212> PRT 

<213> Homo sapiens 
<400> 12 

Pro lie Asp Thr Phe Leu Met Glu Met Leu Glu Ala 
1 5 10 



<210> 13 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 13 

Val Glu Ala Val Gin Glu lie Thr Glu Tyr Ala Lys Asn He Pro Gly 
1 5 10 15 

Phe He Asn Leu Asp Leu Asn Asp Gin Val Thr Leu Leu Lys Tyr Gly 
20 25 30 

Val 



<210> 14 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 14 

Ser Leu His Pro Leu Leu Gin Glu lie Tyr Lys Asp 
15 10 



<210> 15 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 15 

Ser Tyr Ser He Gin Lys Val He Gly Phe Ala Lys Met He Pro Gly 
15 10 15 

Phe Arg Asp Leu Thr Ser Glu Asp Gin He Val Leu Leu Lys Ser Ser 
20 25 30 

Ala 



<210> 16 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 16 

Lys Leu Thr Pro Leu Val Leu Glu Val Phe Gly Asn 
15 10 
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<210> 17 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 17 

Asp Arg Glu Leu Val His Met lie Asn Trp Ala Lys Arg Val Pro Gly 
1 5 10 15 

Phe Val Asp Leu Thr Leu His Asp Gin Val His Leu Leu Glu Cys Ala 
20 25 30 

Trp 



<210> 18 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Pro Leu Tyr Asp Leu Leu Leu Glu Met Leu Asp Ala 
1 5 10 



<210> 19 
<211>, 33 
<212> PRT 

<213> Homo sapiens 
<400> 19 

Gly Arg Gin Val lie Ala Ala Val Lys Trp Ala Lys Ala lie Pro Gly 
15 10 15 

Phe Arg Asn Leu His Leu Asp Asp Gin Met Thr Leu Leu Gin Tyr Ser 
20 25 30 

Trp 



<210> 20 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 20 

Glu Phe Pro Glu Met Leu Ala Glu lie He Thr Asn 
15 10 



<210> 21 

<211> 33 

<212> PRT 

<213> Homo sapiens 



<400> 21 
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Glu Arg Gin Leu Leu Ser Val Val Lys Trp Ser Lys Ser Leu Pro Gly 
15 10 15 

Phe Arg Asn Leu His lie Asp Asp Gin lie Thr Leu lie Gin Tyr Ser 
20 25 30 

Trp 



<210> 22 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Glu Phe Pro Glu Met Met Ser Glu Val lie Ala Ala 
15 10 



<210> 23 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 23 

Gly Lys Gin Met lie Gin Val Val Lys Trp Ala Lys Val Leu Pro Gly 
15 10 15 

Phe Lys Asn Leu Pro Leu Glu Asp Gin lie Thr Leu lie Gin Tyr Ser 
20 25 30 

Trp 



<210> 24 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 24 

Glu Phe Pro Ala Met Leu Val Glu lie lie Ser Asp 
15 10 



<210> 25 

<211> 33 

<212> PRT 

<213> Homo sapiens 



c400> 25 

Glu Arg Gin Leu Val His Val Val Lys Trp Ala Lys Ala Leu Pro Gly 
15 10 15 

Phe Arg Asn Leu His Val Asp Asp Gin Met Ala Val lie Gin Tyr Ser 
20 25 30 
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Trp 



<210> 26 

<211> 12 

<212> PRT 

<213> Homo sapiens 

c400> 26 

Asp Phe Pro Glu Met Met Ala Glu lie lie Ser Val 
1 5 10 
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